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Sir: 



RE- 



This Brief is filed in response to the final Office Action mailed January 31, 1995, 
regarding the above-captioned application. This brief is due July 31, 1995, by virtue of the 
Notice of Appeal filed on May 31, 1995. 



L STATUS OF THE CLAIMS 

Claims 1-67 were filed with the application, and various claims have been withdrawn, 
and various claims amended, during the course of prosecution. Claims 2-4, 47 and 50-58 and 
60-67 are currently pending and are the subject of the present appeal. A copy of tfrese pehding 
claims is attached as Exhibit A. 

H. STATUS OF THE AMENDMENTS 

An amendment after final was sought in an amendment filed on June 12, 1995, and these 
amendments were introduced per an Advisory Action dated July 27, 1995. 

A further amendment is sought concurrently with the filing of the present Brief. It is 
believed that these amendments should be acceptable to the Examiner as only claims and species 
have been removed, and no new issues presented. 

m. REAL PARTIES IN INTEREST 

The real party in interest in the present appeal is DeKalb Genetics Corporation, assignee 
of the captioned application. 

IV. RELATED APPEALS AND INTERFERENCES 

An appeal is currently pending in a related case, USSN 07/565,844. 
A notice of appeal has been filed in USSN 08/112,245. 



V. SUMMARY OF THE INVENTION 

The invention of the claims pending in this Appeal relates to fertile, transgenic maize 
(corn) plants having a selected, specified "transgene" introduced into their genetic makeup. 
There is no single claim pending that is generic to all of the pending claims, and the claims are 
instead directed to a large number of what are submitted to be patentably distinct species. In 
particular, the independent claims on appeal are directed to fertile, transgenic corn plants having 
one or more of the following genes inserted into their genome: 

Claim 47 - Selectable or Screenable Marker Genes 
an aequorin gene 

a gene encoding a cell wall protein 
an HPRG gene 

Claim 60 -- Negatively Selectable Marker Genes 

a cytosine deaminase gene 
a T-DNA gene 2 
an antisense bar gene 
an antisense nptU gene 

Claim 61 - Inducible or Tissue Specific Promoter or Enhancers 

an a-tubulin promoter 
an ocs promoter 
an ABA-inducible promoter 
a turgor-inducible promoter 

Claim 62 — Herbicide Resistance Genes 

a bxn gene 
Claim 63 -- Insect Resistance Genes 

an oryzacystatin gene 



a wheat or barley amylase inhibitor gene 
a lipoxygenase gene 

an ecdysteroid UDP-glucosyl transferase gene 
a DIMBOA synthetic gene of the bx locus 

Claim 64 - Disease Resi stance Genes 

a pathogenesis related (PR) protein gene 

Claim 65 — Stress Resistance Genes 

a glycerol-3-phosphate acetyltransferase gene 
a superoxide dismutase gene 
a glutathione reductase gene 

Claim 66 - Drought Resistance Genes 

a mannitol-1 -phosphate dehydrogenase gene 
a trehalose-6-phosphate synthase gene 
a myoinositol O-methyltransferase gene 
a Late Embryogenic Protein (LEA) gene 

Claim 67 — Grain Composition Genes 

an acetyl-CoA carboxylase gene 

an ACP-acyltransferase gene 

a b-ketoacyl-ACP synthase gene 

an acyl carrier protein gene 

a fatty acid desaturase gene 

a fatty acid epoxidase gene 

a fatty acid hydratase gene 

a fatty acid dehydratase gene 

a sense or antisense phytoene synthase gene 

a sense or antisense phytoene desaturase gene 

a sense or antisense lycopene synthase gene 

a phytase gene 

an ADP-glucose pyrophosphorylase gene 

a starch synthase gene 

a starch branching enzyme gene 

a sucrose synthase gene 
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The remaining claims specify cells, seed and progeny, particular preferred promoters or 
other control elements, or breakout individual species. 

It is Appellants' position that the more than 45 different species of corn plants 
represented by the above independent claims on appeal are separate and patentably distinct. 

VI. ISSUES ON APPEAL 

It is believed that the following three rejections are the only issues that remain for appeal: 

(1) The rejection of all the claims on the basis of provisional obviousness-type double 
patenting over copending USSN 07/508,045, filed April 11, 1990 (the '045 
specification, attached as Exhibit B); 

(2) The rejection of all of the claims as obvious over the '045 application; and 

(3) The rejection of all of the claims as obvious over the Goldman et al. patent 
(hereinafter "Goldman"), US 5,187,073 (Exhibit C) alone or in combination with 
what the Examiner states are Appellants' acknowledged state of the art (see Office 
Action of 6/28/94, page 7). 

VH. GROUPING OF THE CLAIMS 

The claims will stand or fall separately. It is Appellants position that a fertile, transgenic 
maize plant bearing one of the recited genetic elements is patentably distinct from a second 
transgenic maize plant bearing another of the recited genetic elements. The subject matter of 
each claim must be judged in accord with the applicable standards of obviousness, and the claims 
are patentably distinct absent proof of obviousness. Even then, the claimed corn plants with 



distinct genes would be considered obvious only where secondary indicia of non-obviousness are 
absent. 

Vffl. ARGUMENT 

A. PROVISIONAL REJECTION OF CLAIMS ON THE BASIS OF 
OBVIOUSNESS-TYPE DOUBLE PATENTING OVER THE CLAIMS OF 
THE '045 APPLICATION 

SUMMARY OF THE REJECTION 

The Action first rejects all of the claims on the basis of obviousness-type double patenting 
over claims 28-68 of the '045 application (see Exhibit B). The Examiner takes the position that 
the present claims are directed to genes and genetic elements previously known in the art, and 
that the introduction of these genes into maize (corn) is obvious in light of the claims of the '045 
application. 

SUMMARY OF APPELLANTS 9 ARGUMENT 

In response, it is respectfully submitted that the subject matter of the claims pending in 
the present appeal is distinct from the subject matter of the '045 application. The claims on 
appeal are directed to transgenic maize bearing particular genes that are novel and nonobvious 
in the context of fertile, transgenic corn plants: There is submitted to be no disclosures in the 
'045 specification that teaches, suggests or in any way motivates one of skill in the art to 
produce the particular transgenic corn plants of the pending claims, and no evidence in the prior 
art that the specified genetic elements would function in corn to achieve a useful result. 



The Action fails to base its finding upon a teaching, suggestion or motivation to employ 
each of the claimed genes ~ a prerequisite to a finding of obviousness-type double patenting. 
Carmen Industries, Inc. v. Wahl, 220 U.S.P.Q. 481, 487 (Fed. Cir. 1983); Mirafi Inc. v. 
Murphy, 14 U.S.P.Q. 1337, 1347 (N.C. 1989). To maintain the rejection it must be shown that 
the subject matter of the later claims are obvious over the claims of the earlier application. It 
is submitted that no such demonstration has been made here, and thus no prima facie rejection 
has been made. 

THE EXAMINER HAS NOT MADE OUT A PRIMA FACIE CASE OF 
OBVIOUSNESS-TYPE DOUBLE PATENTING 

A rejection on the basis of obviousness-type double patenting is treated in much the same 
way as obviousness rejections under 35 U.S.C. § 103. In order to make out a prima facie 
rejection, an examiner must demonstrate that subject claims of the later patent are obvious in 
light of the claims of the earlier patent. In re Stanley, 102 U.S.P.Q. 234 (CCPA 1954). 
Moreover, where the earlier patent discloses a genus and certain species, and the subject claims 
are directed to species not specifically disclosed in the earlier patent, the claims of the subject 
patent are patentably distinct absent evidence that the specific later species are obvious. In re 
Sarett 9 140 U.S.P.Q. 474 (CCPA 1964). 



1. An Earlier Generic Patent Does Not Render prima facie Obvious Claims to 
a Later Species Under Double Patenting 

The proposition that an earlier generic patent and a later species/improvement patent does 

not raise obviousness-type double patenting concerns has been more recently reconfirmed by the 

Federal Circuit in the case of In re Kaplan, 229 U.S.P.Q. 678 (Fed. Cir. 1986), the court citing 

to E. Stringham's article Double Patenting: 

One of the simplest, clearest, soundest and most essential principles of patent law, 
is that later invention may be validly patented, altho [sic] dominated by an earlier patent, 
whether to the same or to a different inventor. 

229 U.S.P.Q. at 682. 

Thus, the fact that the '045 claims may dominate the subject claims is totally irrelevant 
to whether the present claims are properly subject to an obviousness-type double patenting 
rejection. The fact remains: the examiner must make out a prima facie case that the present 
claims are obvious in light of the claims of the earlier application. It is submitted that the 
Examiner has failed to make out a proper prima facie case of obviousness. 

The present claims are said to be obvious over those of the earlier-filed application (or 
claims) based on the fact that the earlier application was directed generically to "fertile, 
transgenic corn" transformed with any selected gene and the present application is directed to 
fertile, transgenic corn comprising one of 50 or so selected specific genes not mentioned in the 
'045 case. While it is true that the genes were known to exist per se 9 Appellants are unaware 
that these genes have been successfully placed into corn and there is no art relied upon by the 
Examiner to teach or suggest fertile, transgenic corn genetically engineered to express the 
specified genes. 
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2. A Prior Art Genus Does Not Render prima facie Obvious a Later Species 

The examiner is relying merely upon a prior genus to obviate presently claimed species . 
It is submitted that obviousness of the present claims in light of the prior genus is governed by 
the Federal Circuit's holdings in In re Jones, 21 U.S.P.Q.2d 1941 (Fed. Cir. 1992) and In re 
Baird, 29 U.S.P.Q.2d 1550 (Fed. Cir. 1994). These cases held that a prior art genus does not 
render prima facie obvious a later species within that genus. 

In Jones, the PTO Solicitor argued the prevailing PTO view that a prior art genus 
rendered prima facie obvious a later claim to a species within that genus. It was at that time 
routine practice in the PTO examining corps to base prima facie obviousness rejections on the 
existence of a prior art genus, thus shifting the burden to applicants to come forward with 
evidence of non-obviousness. The claim in Jones was directed to a new salt of the previously 
known herbicide "dicamba." The Solicitor argued that since "dicamba" per se was known in 
the art, and that the particular salt cation had been previously known, that the dicamba salt 
formed with the known cation was prima facie obvious. The court disagreed, and held that no 
prima facie rejection had been made, relying on the large number of species embraced by the 
prior art genus and the absence of any specific motivation to use the particular salt cation in the 
context of dicamba. 21 U.S.P.Q.2d at 1943. 

The Jones court specifically rejected the Solicitor's argument that one of skill in the art 
would have been motivated to use the particular salt cation with dicamba since structurally 
similar salts had previously been prepared and were in the prior art. In rejecting the Solicitor's 
argument, the court observed that: 



Before the PTO may combine the disclosures of two or more prior art references in order 
to establish prima facie obviousness, there must be some suggestion for doing so . found 
either in the references themselves or in the knowledge generally available to one of 
ordinary skill . . . 

Conspicuously missing from this record is any evidence, other than the PTO's 
speculation (if it be called evidence) that one of ordinary skill in the herbicidal art would 
be motivated to make the modifications in the prior art salt necessary to arrive at the 
[claimed] salt, (underlining added) 

21 U.S.P.Q. at 1943-44. 

The Federal Circuit has recently reaffirmed the Jones holding in the case of In re Baird, 
29 U.S.P.Q.2d 1550 (Fed. Cir. 1994). The Baird court followed Jones, stating that "[a] 
disclosure of millions of compounds does not render obvious a claim to three compounds". 29 
U.S.P.Q.2d at 1552. The court again considered it particularly relevant that the prior art genus 
was quite broad, as in the present case, and focused on the fact that there was no specific 
motivation in the evidence or record for preparing the particular species of the subject claims. 1 

These facts of Jones are consistent with those of the present case. The prior '045 claims 
are generic claims (and some distinct species claims) directed fertile, transgenic plants. In 
contrast, the present claims are directed to specific corn plants bearing specific distinct 
introduced genes. Just as in Jones, it is agreed that the genes one would use to prepare the 
presently claimed transgenic corn species were known. However, also just as in Jones, the 
Examiner here has presented no evidence to demonstrate that one of skill in the art would be 



1 It is of interest to note that in March 1994 following the decision in Baird, Commissioner 
of Patents Lehman distributed a directive to examiners, instructing them to disregard the Baird 
decision in making prima facie rejections of species embraced by prior generic disclosures. See 
1161 OG 314 (4/14/94). However, in an April 17, 1995 notice to examiners, the Commissioner 
rescinded the earlier notice, agreeing that Baird was good law. See 50 PTCJ 3 (May 4, 1995). 
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motivated to introduce the particular genes set forth in the claims into corn, and no evidence that 
one of skill would have predicted, when the present application was filed, that such genes would 
function appropriately in the resultant engineered corn. 

THE EXAMINER HAS NOT MET THE STANDARD OF JONES AND BAIRD 

The Examiner's only statement on the record regarding this obviousness-type double 

patenting rejection is set forth in the Office Actions of 6/28/94 and 4/17/95. In the 6/28/94 

Action where the rejection was first entered, the Examiner stated merely that: 

Although the conflicting claims are not identical, they are not patentably distinct from 
each other because each application is drawn to fertile transgenic maize in which the 
genome is augmented by the addition of DNA not normally found in maize or if found 
in maize is inserted, modified, altered or otherwise manipulated to the extent that it 
afford/s a change that is detectable over maize not transformed by phenotypic or 
genotypic change. Each application differs by the specific recitation of the DNA of 
interest which is deemed to be experimenter choice. 

It is submitted that such a statement, with nothing more, fails to raise a prima facie case 
of obviousness-type double patenting under the standards set forth in Sarett, Jones and Baird. 
The mere recitation of a prior art genus does not meet these standards, 

The Final Action of 4/17/95 observes that certain genetic starting materials were known 
in the art at the time the present invention was made. The Action argues that if the prior art 
genetic elements function in a predictable manner when placed into corn, then a prima facie case 
of obviousness has been shown. The Action has confused the applicable standards: The issue 
is not whether the prior art genetic elements actually function in corn, the issue is whether the 
prior art provides a proper motivation and evidences in advance that the invention would be 
successful. 
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Appellants respond by noting that no specific teaching have been pointed to by the 
Examiner. Instead, the Action merely posits that Appellants' specification "admits" that the 
starting materials were known. Appellants' specification merely teaches how one would obtain 
useful starting materials to prepare the present invention. It is submitted that Appellants' 
specification is the first to provide the requisite motivation to obtain these particular genetic 
starting materials and introduce them into corn. The Examiner's reliance upon Appellants' 
specification to provide the missing motivation constitutes impermissible hindsight reconstruction 
of the invention. In reDeminski, 230 U.S.P.Q. 313 (Fed. Cir. 1986). 

THE STANDARD OF OBVIOUSNESS UNDER 
O'FARRELL AND VAECK HAVE ALSO NOT BEEN MET 

Appellants would further request the Board to consider the case of In re O'Farrett, 7 

U.S.P.Q.2d 1673, 1680 (Fed. Cir. 1988), which held that in order for a prior art teaching to 

obviate an invention, it must be shown that the teaching contains: 

(1) detailed enabling methodology for practicing the claimed invention; 

(2) a suggestion for modifying the prior art to practice the claimed invention; and 

(3) evidence suggesting that the invention would be successful. 

In the more recent case of In re Vaeck, 20 U.S.P.Q. 1438 (Fed. Cir. 1991), the Federal 
Circuit took the O'Farrett doctrine a step further. In Vaeck the Federal Circuit stated that in 
order for an examiner to make out a prima facie case of obviousness two things must be shown: 
1) that the prior art would have suggested to those of ordinary skill in the art that they should 
make the claimed composition; and 2) that the prior art must demonstrate a reasonable 
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expectation of success of the invention. Both the suggestion and the reasonable expectation of 
success must be founded in the prior art, not in the applicant's disclosure. 

1. There is No Evidence of Record that the Prior Art Would Predict that these 
Genetic Elements Would Function in Genetically Engineered Corn. 

The Examiner's argument appears to be based upon the assumption that the prior art 

teaches that the specified genes would function predictably in corn. While this standard 

overlooks the requirement for a motivation (discussed below), Appellants submit that there is 

no evidence of record that the prior art teaches the required predictability. The Action appears 

to suggest that the requirement for predictability is satisfied by Appellants' specification 

(Advisory Action, page 4, lines 75-79; "... previously known and isolated genes ... which have 

been shown to function transgenically, once inserted into maize ... are made obvious ..."). This 

would again constitute impermissible hindsight reconstruction. Absent an actual admission, 

nothing in appellants' specification can be considered pertinent to a prior art rejection. In re 

Ruff, 118 U.S.P.Q. 340 (CCPA 1958). The issue is whether the prior art teaches that the 

claimed genetic elements would function predictably when genetically engineered into corn - 

NOT whether the claimed elements do indeed function predictably in corn. No such teaching 

has been described by the Examiner. 
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2. No Motivation For Preparing the Claimed Genetically Engineered Corn 
Species Has Been Shown 

The Examiner's reliance on the predictability issues overlooks the second part of the 
OTarrell test; a motivation must be shown in the prior art for genetically engineering corn with 
the specific recited genetic elements. 

In order to find the appealed claims obvious, the Examiner must demonstrate that the 
prior art provides a motivation for one of skill in the art to introduce the specified genes into 
corn. The claimed combination of elements in each of the claims ~ which specify particular 
foreign genes introduced into the corn genome ~ is a novel combination. As such, it is 
incumbent upon the Examiner to identify individual teaching(s) that would suggest the 
desirability of making each of the claimed combinations. As recently pointed out by the Federal 
Circuit, in assessing the patentability of a novel combination of otherwise old elements, "[t]he 
critical inquiry is whether there is something in the prior art as a whole to suggest the 
desirability, and thus the obviousness, of making the combination." In re Newell, 13 
U.S.P.Q.2d 1248, 1250 (Fed. Cir. 1989). 

The requirement that examiner's identify such a motivation is a longstanding patent law 
doctrine, even in biological cases. In the well-known case of In re Bergel and Stock, the CCPA 
stated it thusly: 

The mere fact that it is possible to find two isolated disclosures that might be combined 
in such a way to produce a new compound does not necessarily render such production 
obvious unless the art also contains something to suggest the desirability of the proposed 
combination. 

130 U.S.P.Q. 206 (CCPA 1961). 
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Motivation to combine elements can not be inferred from prior art that discloses one of 

the elements of the combination. This is the clear meaning of the relevant case law, such as In 

re Gordon, wherein the Federal Circuit stated: 

The mere fact that the prior art could be so modified would not have made the 
modification obvious unless the prior art suggested the desirability of the modification. 

221 U.S.P.Q. 1125 (Fed. Cir. 1984). 

This doctrine has been routinely embraced by the PTO Board of Appeals. For example, 
in Ex parte Kranz, the Board made it clear that examiner's must identify either an explicit 
motivation, or a "compelling motivation based upon sound scientific principles. " 19 U.S.P.Q.2d 
1216, 1218 (BPAI 1990). 

Here, the Examiner has taken the position that since the genes are known, and some have 
been placed into a particular plant species other than corn or previously expressed in a 
bacterium, that the introduction of the gene into corn in somehow inherently motivated. It is 
indeed Appellants' specification that provides for the first time the motivation for introducing 
the recited genes into corn -- and for this reason it is improper hindsight for the Examiner to 
rely upon Appellants' own specification to provide the necessary motivation. Deminski. This 
is certainly not the "compelling motivation" based upon "sound scientific principles" referred 
to by the Kranz Board, and certainly an insufficient basis for a prima facie case of obviousness. 
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B. REJECTION OF CLAIMS AS OBVIOUS OVER THE '045 APPLICATION 
SUMMARY OF THE REJECTION 

The Examiner next rejects the claims as obvious over the '045 specification. The basis 
for the obviousness rejection is essentially as set forth above with respect to the obviousness-type 
double patenting rejection, with the exception that the specification of the '045 application can 
be considered. Appellants respond initially by incorporating herein by reference the arguments 
set forth in the obviousness-type double patenting rejection. 

Applicants would further address two specific points raised by the Examiner in the final 
rejection, and the Advisory Action. 

SECTION 103 AND DEUEL MANDATE THAT THE OBVIOUSNESS OF A 
METHOD OF MAKING AN INVENTION IS IRRELEVANT TO WHETHER THE 
INVENTION ITSELF IS OBVIOUS 

In the final Action (at page 6, lines 7-10), the Examiner states that since the various 
genes set forth in the claims are admittedly known in the art, there is nothing nonobvious about 
the claimed invention. 

In response to this comment, Appellants reiterate that they are not claiming the various 
genes per se, but are claiming transgenic plants. In order to make out a prima facie rejection, 
the Examiner is bound to present prior art demonstrating that the claimed transgenic corn plants 
bearing the specified gene in their genome are obvious. 

This is a case that falls under the ambit of the recent Federal Circuit case of In re Deuel, 
34 U.S.P.Q.2d 1211 (Fed. Cir. 1995). The facts of Deuel are on all fours with those here: In 
Deuel, the applicant had cloned a particular gene encoding a known protein. The examiner 
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rejected the claim on the basis that the protein was known, and the methodology for cloning the 

gene from the known protein was routine in the art. The Federal Circuit reversed the examiner 

and the Board of Appeals, holding that the gene is not obvious simply because its encoded 

protein and the general cloning method were known. The court concluded that just because a 

method of preparing the cloned gene was known was irrelevant to the issue of obviousness. In 

the context of a claimed structure, it is imperative that the PTO demonstrate that the claimed 

structure is obvious, not that an obvious method for making the structure was known: 

The PTO's focus on known methods for potentially isolating the claimed DNA 
molecules is also misplaced because the claims at issue define compounds, not methods. 
See In re Bell, 991 F.2d 781, 785, 26 U.S.P.Q.2d 1529, 1532 . . . 

We today reaffirm the principle, stated in Bell, that the existence of a general 
method of isolating cDNA or DNA molecules is essentially irrelevant to the question 
whether the specific molecules themselves would have been obvious, in the absence of 
other prior art that suggests the claimed DNAs. 

34 U.S.P.Q.2d at 1215. 

The court also dismissed out of hand the concept argued by the PTO that since the 
underlying protein was known to have utility there existed a general motivation to clone the 
claimed gene: 

The PTO's theory that one might have been motivated to try to do what Deuel in 
fact accomplished amounts to speculation and an impermissible hindsight reconstruction 
of the claimed invention . . . any motivation that existed [in the prior art] was a general 
one, to try to obtain a gene that was yet undefined and may have constituted many forms. 

The court concluded that a general motivation to search for the gene of a known and 
characterized protein does not make prima facie obvious a gene that is subsequently obtained as 
a result of that search. 
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The legal issue decided in Deuel is not unlike the issue here: Whether the admittedly 
novel transgenic corn plants of the present invention are obvious over different transgenic 
corn plants of the '045 application, merely because the '045 application teaches techniques 
for obtaining transgenic corn plants in general. Both Section 103 and Deuel mandate a 
conclusion of nonobviousness. Section 103 states that "patentability shall not be negatived 
by the manner in which the invention was made." The fact that a general technique exists 
for introducing a particular gene into corn is not relevant to the issue of obviousness under 
Deuel. Deuel and the earlier case of Bell reaffirm this principle, and each hold that 
reference must not be made to the obviousness of the method employed to make an 
invention. There is no teaching in the '045 specification as to the presently claimed plants, 
and admittedly no teaching in the '045 application with respect to the genes employed in the 
present invention. 

In the Advisory Action, the Examiner appeared to agree with the foregoing 
proposition (Advisory Action, page 4, lines 74-75), yet attempts to draw a distinction by 
stating that the specific genes introduced by Appellants are otherwise known and have been 
"isolated and expressed. n This is not a distinction, and is no different from the fact that in 
Deuel the protein encoded by the claimed gene was known and had been characterized. The 
Deuel court held that the protein was not relevant to the patentability of its corresponding 
gene, even though a technique was known for isolating the gene using the known 
characteristics of the protein. Id. Here, the fact that the previously known gene may have 
been isolated and expressed in a bacterium or even in another plant does not render obvious 
a transgenic corn plant that has been genetically engineered to express the gene. As 
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discussed above in responding to the obviousness-type double patenting rejection, it must be 
shown that a motivation existed in the prior art for making the claimed invention, and that it 
was predictable that the genetic element would function in a predictable manner in corn. 
O Tarrell and Vaeck. 

THE FACT THAT STARTING MATERIALS WERE KNOWN IS NOT RELEVANT 
TO PATENTABILITY - THE ART DOES NOT TEACH OR SUGGEST THE 
PARTICULAR CLAIMED SPECIES 

The Advisory Action indirectly cites to teachings said to be contained in "references 
cited in [Appellants'] specification. * (Advisory Action, page 4, lines 81-82). Appellants 
would again caution that while prior art references can be considered for what they teach, 
reliance upon Appellants' specification by the Examiner would constitute impermissible 
hindsight. Moreover, the Examiner has not listed in the rejection the specific references that 
are being relied upon. 

If it is considered that the references and/or teachings incorporated into Appellants' 
specification are properly a part of this rejection, then it is submitted that in order to make 
out a prima facie rejection, the Examiner should consider each such teaching individually and 
it should be explained on the record by the Examiner how the reference teaches or suggests 
the subject matter of the claim. Ex parte Goeddel, 5 U.S.P.Q.2d 1449 (BPAI 1985). This 
has not been accomplished, and no prima facie case has been made. 

Appellants will now review the subject matter of the individual claims that are to be 
considered separately for the purposes of the § 103 rejections, and further review the 
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references set forth in Appellants 1 specification and apparently relied upon by the Examiner 
to support this rejection. 

Claim 47 - Selectable or Screenable Marker Genes 

Independent claim 47 is a Markush claim directed to a transgenic corn plant having 
one or more of two different selectable marker genes inserted in its genome, selected from: 

an aequorin gene; and 

a gene encoding a cell wall protein (e.g. , an HPRG gene (claim 57)). 

The Action fails to make out a prima facie case of obviousness with respect to any 
one of the foregoing transgenic corn species. No motivation for engineering these genetic 
elements into corn has been shown, and no expectation has been shown that they would 
function in a desired fashion even if successfully engineered into corn. 

References relating to the foregoing starting materials are set forth in Exhibit D. Art 
exemplary of the aequorin gene starting material used in the preparation of the corn plants of 
claim 47 is reported in Knight et al. } which discusses the introduction and expression of the 
aequorin gene in transgenic tobacco. The Steifel et ah reference studies the expression of 
the maize HRGP gene in non-transgenic plants , and reports the cloning of the gene in an 2L 
coli host (using an EMBL3 vector). 

It is submitted that these references fail to render the claimed species prima facie 
obvious, as none meet the requirements of O'Farrell, Vaeck, Bell and Deuel. In particular, 
none of these references teach or suggest the introduction of the specified genetic element 
into corn (no motivation), and/or none evidence that the gene once introduced into corn 
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would successfully function to provide a benefit or solve a problem once introduced into 
corn. 

Claim 60 -- Negatively Selectable Marker Genes 

Independent claim 60 is a Markush claim directed to a transgenic corn plant 

comprising one or more of the following different genes in order to provide a negatively 

selectable marker in the resultant genetically engineered corn: 

a cytosine deaminase gene; 
a T-DNA gene 2; 
an antisense bar gene; and 
an antisense nptn gene. 

It is again submitted that the Action fails to make out a prima facie case of 
obviousness with respect to any one of the foregoing transgenic corn species. No motivation 
for engineering these genetic elements into corn has been shown, and no expectation has 
been shown that they would function in a desired fashion even if successfully engineered into 
corn. 

References relating to the foregoing starting materials are set forth in Exhibit E. 
These three references, Stougaard et al., Depicker et al. and Xiang et al., individually relate 
to the expression of three of the foregoing genes in tobacco . It is submitted that the 
introduction and expression of these genes— the cytosine deaminase gene, the T-DNA gene 2 
and the anti-nptn gene— in corn is novel and nonobvious. 

The introduction and expression of the sense version of bar gene in corn has been 
described in the art of record (see, e.g., Ref. C76, Gordon-Kamm et ah, The Plant Cell). 
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However, it is submitted that the purposeful expression of the antisense bar gene is not 
described in the prior art. 

It is submitted that these teachings fail to render the claimed species prima facie 
obvious, as none meet the requirements of O'Farrell, Vaeck, Bell and Deuel. In particular, 
none of these references teach or suggest the introduction of the specified genetic element 
into corn (no motivation), and/or none teach or suggest that the gene once introduced into 
corn would function to provide a benefit and/or solve a problem. 

Claim 61 — Inducible or Tissue Specific Promoter or Enhancers 

Independent claim 61 is a Markush claim directed to a transgenic corn plant 

comprising one or more of following different genetic control elements in order to provide an 

inducible or tissue specific promoter or enhancer capability to the resultant genetically 

engineered corn: 

an a-tubulin promoter; 

an ocs promoter; 

an ABA-inducible promoter; and 

a turgor-inducible promoter. 

The Action fails to make out a prima facie case of obviousness with respect to any 
one of the foregoing transgenic corn species. No motivation for engineering these genetic 
elements into corn has been shown, and no expectation has been shown that they would 
function in a desired fashion even if successfully engineered into corn. 

References related to the foregoing starting materials are set forth in Exhibit F. 
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Guerrero et al. studies the expression of turgor-inducible genes in pea shoots, through 
cloning of the relevant genes in E. coli. Montoliu et al. similarly studies the expression of 
a-tubulin genes in nontransgenic corn, using clones of genes prepared in E. coli. Mundy et 
al. relates to the ABA-inducible promoter of rice. Kononowicz et al. relates to the ocs 
promoter and its use in transgenic tobacco plants. 

It is submitted that these references fail to render the claimed species prima facie 
obvious, as none meet the requirements of O'Farrell, Vaeck, Bell and Deuel. In particular, 
none of these references teach or suggest the introduction of the specified genetic element 
into corn (no motivation), and/or none teach or suggest that the gene once introduced into 
corn would function to provide a benefit or solve a problem. 

Claim 62 -- Herbicide R esistance Genes 

Independent claim 62 is directed to a transgenic corn plant comprising the bxn gene in 
order to provide herbicide resistance characteristics to the resultant genetically engineered 
corn. The Action fails to make out a prima facie case of obviousness with respect to 
transgenic corn bearing the bxn gene. No motivation for engineering the bxn gene into corn 
has been shown, and no expectation has been shown that it would function in a desired 
fashion even if successfully engineered into corn. 

The Stalker et al. reference of Exhibit G, is believed to be exemplary of the 
preparation of the starting material bxn gene. Stalker et al. relates to the cloning and 
expression of the soil bacterium bxn gene in transgenic tobacco plants. It is submitted that 
this reference fails to render the claimed transgenic corn species prima facie obvious, as none 
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meet the requirements of O'Farrett, Vaeck, Bell and Deuel. In particular, references such as 
Stalker et al. fail to teach or suggest the introduction of the bxn gene into corn (no 
motivation), and such fail to evidence that the gene once introduced into corn would function 
to provide a benefit or solve a problem. 

Claim 63 - Insect Resistance Genes 

Independent claim 63 is a Markush claim directed to a transgenic corn plant having 

one or more of following different genes in order to provide insect resistance characteristics 

to the resultant genetically engineered corn: 

an oryzacystatin gene; 

a wheat or barley amylase inhibitor gene; 

a lipoxygenase gene; 

an ecdysteroid UDP-glucosyl transferase gene; and 
a DIMBOA synthetic gene of the bx locus. 

The Action again fails to make out a prima facie case of obviousness with respect to 
any one of the foregoing transgenic corn species. No motivation for engineering these 
genetic elements into corn has been shown, and no expectation has been shown that they 
would function in a desired fashion even if successfully engineered into corn. 

References related to the foregoing starting materials are set forth in Exhibit H. Abe 
et al. concerns the cloning of an oryzacystatin gene in E. coli . Mundy et al. similarly relates 
to the cloning of the a-amylase gene in E. coli. Yenofsky et al. also relates to cloning in E. 
coliy here the cloning of the lipoxygenase gene. O'Reilly et al. concerns the UDP-glycosyl 
transferase gene of an insect baculovirus. Dunn et al. recognizes the existence of the 
DIMBOA gene in non-transgenic corn. 
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It is submitted that these references fail to render the claimed species prima facie 
obvious, as none meet the requirements of O Tamil, Vaeck, Bell and Deuel In particular, 
none of these references teach or suggest the introduction of the specified genetic element 
into corn (no motivation), and none teach or suggest that the gene once introduced into corn 
would function to provide a benefit or solve a problem. 

Claim 64 — Disease Resistance Genes 

Independent claim 64 is directed to a transgenic corn plant having a pathogenesis 
related (PR) protein gene in order to provide disease resistance to the resultant genetically 
engineered corn. 

The Action again fails to make out a prima facie case of obviousness with respect to 
the foregoing transgenic corn species. No motivation for engineering a PR protein gene into 
corn has been shown, and no expectation has been shown that it would function in a desired 
fashion even if successfully engineered into corn. Attached as Exhibit I is the article of Bol 
et al., which reports the existence of PR proteins, but does not suggest the preparation of 
transgenic corn bearing such genes. The Bol et al. reference fails to render the claimed 
species prima facie obvious, as it fails to meet the requirements of O Tamil 9 Vaeck, Bell 
and Deuel. In particular, it fails to teach the introduction of the specified genetic element 
into corn (no motivation), and fails to teach or suggest that the gene once introduced into 
corn would function to provide a benefit or solve a problem. 
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Claim 65 - Stress Resistance Genes 

Independent claim 65 is a Markush claim directed to a transgenic corn plant having 

one or more of following different genes in order to provide stress resistance characteristics 

to the resultant genetically engineered corn: 

a glycerol-3-phosphate acetyltransferase gene; 
a superoxide dismutase gene; and 
a glutathione reductase gene. 

The Action again fails to make out a prima facie case of obviousness with respect to 
any one of the foregoing transgenic corn species. No motivation for engineering these 
genetic elements into corn has been shown, and no expectation has been shown that they 
would function in a desired fashion even if successfully engineered into corn. 

Attached as Exhibit J are various references concerning the foregoing genes, including 
Gupta et aL, Wolter et al., Smith et al., Malan et al. and Bowler et al. It is submitted that 
these references merely concern the existence of these genes, and none teach or suggest 
transgenic corn bearing the specified genes. As such, it is submitted that these references 
fail to render the claimed species prima facie obvious, as none meet the requirements of 
O'Farrett, Vaeck, Bell and Deuel. In particular, none of these references teach or suggest 
the introduction of the specified genetic element into corn (no motivation), and none teach or 
suggest that the gene once introduced into corn would function to provide a benefit or solve a 
problem. 
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Claim 66 - Drought Resistance Genes 

Independent claim 66 is a Markush claim directed to a transgenic corn plant having 

one or more of following different genes in order to provide drought resistance 

characteristics to the resultant genetically engineered corn: 

a mannitoM -phosphate dehydrogenase gene; 
a trehalose-6-phosphate synthase gene; 
a myoinositol O-methyltransferase gene; and 
a Late Embryogenic Protein (LEA) gene. 

The Action again fails to make out a prima facie case of obviousness with respect to 
any one of the foregoing transgenic corn species. No motivation for engineering these 
genetic elements into corn has been shown, and no expectation has been shown that they 
would function in a desired fashion even if successfully engineered into corn. 

Attached as Exhibit K are various references concerning the foregoing genes, 
including Tarczynski et al., Kaasen et al., Vernon et aL, and Dure et al. It is submitted that 
these references merely concern the existence of these genes, and none teach or suggest 
transgenic corn bearing the specified genes. As such, it is submitted that these references 
fail to render the claimed species prima facie obvious, as none meet the requirements of 
O'Farrell, Vaeck, Bell and Deuel. In particular, none of these references teach or suggest 
the introduction of the specified genetic element into corn (no motivation), and none teach or 
suggest that the gene once introduced into corn would function to provide a benefit or solve a 
problem. 
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Claim 67 — Grain Composition Genes 

Independent claim 67 is a Markush claim directed to a transgenic corn plant having 

one or more of following different genes in order to provide certain grain composition 

characteristics to the resultant genetically engineered corn: 

an acetyl-CoA carboxylase gene; 

an ACP-acyltransferase gene; 

a b-ketoacyl-ACP synthase gene; 

an acyl carrier protein gene; 

a fatty acid desaturase gene; 

a fatty acid epoxidase gene; 

a fatty acid hydratase gene; 

a fatty acid dehydratase gene; 

a sense or anti sense phytoene synthase gene; 

a sense or antisense phytoene desaturase gene; 

a sense or antisense lycopene synthase gene; 

a phytase gene; 

an ADP-glucose pyrophosphorylase gene; 
a starch synthase gene; 
a starch branching enzyme gene; and 
a sucrose synthase gene. 

The Action again fails to make out a prima facie case of obviousness with respect to 
any one of the foregoing transgenic corn species. No motivation for engineering these 
genetic elements into corn has been shown, and no expectation has been shown that they 
would function in a desired fashion even if successfully engineered into corn. Attached as 
Exhibit L are references concerning various of the foregoing genes. It is submitted that these 
references merely concern the existence of these genes, and none teach or suggest transgenic 
corn bearing the specified genes. As such, it is submitted that these references fail to render 
the claimed species prima facie obvious, as none meet the requirements of O'Farrell, Vaeck, 
Bell and Deuel. In particular, none of these references teach or suggest the introduction of 
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the specified genetic element into corn (no motivation), and none teach or suggest that the 
gene once introduced into corn would function to provide a benefit or solve a problem. 

THE EXAMINER AGREES THAT TRANSGENIC CORN PLANTS BEARING 
GENES NOT PREVIOUSLY EXPRESSED IN PLANTS ARE NON-OBVIOUS, 
THUS MOOTING THE OBVIOUSNESS ISSUE WITH RESPECT TO CLAIMS 
DIRECTED TO THESE SPECIES 

In the Advisory Action, the Examiner appears to agree that claims directed to 
transgenic corn plants bearing genes NOT previously expressed in plants are patentable over 
the art. For example, at page 4, lines 86-89, it stated that "Applicants are correct that a 
gene which has no known function in a plant and which has not been demonstrated in the art 
to affect a plants phenotype, such as insect resistance or increase in storage proteins, would 
not provide— by its mere existence— motivation to place it in a plant ..." 

Appellants agree with the foregoing proposition, but not with its implication. It is 
Appellants' position that the mere fact that a gene has been expressed in a plant other than 
corn, is not sufficient to render obvious a claim to transgenic corn expressing the specified 
gene for the reasons discussed above. Thus, Applicants respectfully disagree with the 
Examiner's implication that nonobviousness can only lie in the introduction of those gene 
species not previously expressed in another plant. 

In light of the foregoing, it is respectfully submitted that the Examiner has not made 
out a prima facie rejection of the claims on the basis of obviousness over the '045 
specification. 
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C. REJECTION OF CLAIMS OVER THE GOLDMAN '073 PATENT 
SUMMARY OF REJECTION 

Lastly, all of the claims have been rejected over the Goldman '073 patent in view of 
what the Examiner states is "Applicants acknowledged state of the prior art," presumably 
referring to the various references set forth in Appellants' specification that teach genetic 
starting materials employed in the preparation of the claimed transgenic corn plants. The 
Action takes the position that Goldman is enabling for the preparation of fertile, transgenic 
corn, and that Appellants' specification admits that the genes employed in the preparation of 
the claimed plants are known. 

Appellants would first advise this Board that the appeal in related case USSN 
07/565,844 (the '844 case), referred to in section IV. above, involves the issue of whether 
the '073 patent is enabling for the preparation of transgenic corn. In that related appeal, 
many arguments are presented to demonstrate that the '073 patent is not enabling for the 
preparation of fertile, transgenic corn. Appellants must reiterate here its position that 
Goldman is in no way enabling for the preparation of fertile, transgenic corn. However, 
(App ellants do not contest in the present appeal the fact that general techniques were known 
for producing transgenic corn plants prior to the making of the present invention. Since corn 
transformation techniques were known at the time the present invention was made, the issue 
of whether the '073 patent is enabling is not ultimately relevant to the patentability of the 
present invention) To insure against any acquiescence, Appellants incorporate by reference 
the arguments set forth in the '844 appeal, and have attached copies of relevant pages of that 
appeal brief as Exhibit M hereto. 
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In addition to the foregoing, to avoid duplication Appellants apply here the arguments 
set forth above in connection with the rejections over the '045 application. It is pointed out 
that the Goldman patent admittedly fails to in any way teach or suggest the particular 
transgenic corn species employed in the present invention, and thus is no more relevant in 
this regard than is the '045 specification addressed above. Because of the significant 
questions regarding the operability of Goldman, it is in fact a much less relevant reference. 

IX. SUMMARY AND CONCLUSION 

In light of the foregoing comments, appellants submit that the appealed claims meet the 
requirements for patentability. Therefore, appellants respectfully request that the Board reverse 
each of the rejections. 7^\M/7 




David L. Parker 
Reg. No. 32,165 



ARNOLD, WHITE & DURKEE 
P.O. Box 4433 
Houston, Texas 77210 
(512) 418-3000 



Attorney for Appellants 



Date: October 18, 1995 
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CLAIMS ON APPEAL 



2. Cells ; obtained from the plant of any one of claims 47 or 60-67, wherein said cells 
comprise the introduced gene. 

^ m -i* 0 ^ P lant of ^ of claims 47 or 60-67, wherein said progeny 
comprise the introduced gene. ys^*j 

4. Seeds obtained from the plant of claim 3, wherein said seeds comprise the introduced 

inn^ 4 J" V* ™\^ Sgenic maize plant, the genome of which has been augmented by the 
mtroducuon of a DNA composition comprising a selectable or screenable marker gene selected 

^1 ^I I0U 1 ° f gen6S ? nsiSting of 3,1 "W*** * ene ™ d a 8 ene enc oding a cell wall protein 
T mC ^ e ? iWtS ° ne ° r more P henot yP ic characteristics that renderU 
identifiable over the corresponding untransformed maize plant which does not comprise said 
gene, and wherein said gene is transmittable through normal sexual reproduction of the 
transgenic maize plant to subsequent generation plants. 

50. The transgenic maize plant of claim 48, wherein the autonomous replication vector 
comprises a transposable element. 

cnntmilf'o ^ of claim 47 > wherein the gene is positioned under the 

control of a promoter region comprising multiple copies of the 16 bp ocs enhancer element. 

_ 52 ' , The ^sgenic maize plant of claim 47, wherein the selectable or screenable marker 
gene comprises non-expressed DNA. er 

53. The transgenic maize plant of claim 47, wherein the selectable or screenable marker 
gene comprises a dalapon dehalogenase (deh) gene. 

«n, JL 7116 ^^c." 13 ^ P lan t of claim 47, wherein the selectable or screenable marker 
gene comprises an anthramlate synthase gene that confers resistance to 5 methyl tryptophan. 

«», JL™ 6 tm ^ ak . maize P lant of claim 47, wherein the selectable or screenable marker 
gene comprises an aequonn gene. 

a „ n 56 ' . The ^sgenic maize plant of claim 47, wherein the selectable or screenable marker 

gene comprises a gene encoding a cell wall protein. 

57. The transgenic maize plant of claim 56, wherein the selectable or screenable marker 
gene comprises a gene encoding an HPRG. 
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58. The transgenic maize plant of claim 47, wherein the coding sequence of the gene 
is modified to improve expression in maize. 

60. A fertile, transgenic maize plant, the genome of which has been augmented by the 
introduction of a DNA composition comprising a negatively-selectable marker selected from the 
group of genes consisting of a cytosine deaminase gene; a T-DNA gene 2; an antisense bar 
gene; and an antisense nptU gene, so that the transgenic plant exhibits one or more 
characteristics that render it identifiable over the corresponding untransformed maize plant which 
does not comprise said marker, and wherein said marker is transmittable through normal sexual 
reproduction of the transgenic maize plant to subsequent generation plants. 

61. A fertile, transgenic maize plant, the genome of which has been augmented by the 
introduction of a DNA composition comprising an exogenous gene encoding a selected trait the 
gene positioned under the control of an inducible or tissue-specific promoter or enhancer 
comprising an a-tubulin promoter, an ocs promoter, an ABA-inducible promoter, or a turgor- 
inducible promoter, so that the transgenic plant exhibits one or more phenotypic characteristics 
that render it identifiable over the corresponding untransformed maize plant which does not 
comprise said gene, and wherein said gene is transmittable through normal sexual reproduction 
of the transgenic maize plant to subsequent generation plants. 

62. A fertile, transgenic maize plant, the genome of which has been augmented by the 
introduction of a DNA composition comprising a gene encoding a herbicide resistance trait 
comprising a bxn gene, so that the transgenic plant exhibits one or more phenotypic 
characteristics that render it identifiable over the corresponding untransformed maize plant which 
does not comprise said gene, and wherein said gene is transmittable through normal sexual 
reproduction of the transgenic maize plant to subsequent generation plants. 

63. A fertile, transgenic maize plant, the genome of which has been augmented by the 
mtroduction of a DNA composition comprising a gene encoding an insect resistance trait selected 
from the group of genes consisting of an oryzacystatin gene; a wheat or barley amylase inhibitor 
gene; a lipoxygenase gene; an ecdysteroid UDP-glucosyl transferase gene; and a DIMBOA 
synthetic gene of the bx locus, so that the transgenic plant exhibits one or more phenotypic 
characteristics that render it identifiable over the corresponding untransformed maize plant which 
does not comprise said gene, and wherein said gene is transmittable through normal sexual 
reproduction of the transgenic maize plant to subsequent generation plants. 

64. A fertile, transgenic maize plant, the genome of which has been augmented by the 
introduction of a DNA composition comprising a gene encoding a pathogenesis related (PR) 
protein gene, so that the transgenic plant exhibits one or more phenotypic characteristics that 
render it identifiable over the corresponding untransformed maize plant which does not comprise 
said gene, and wherein said gene is transmittable through normal sexual reproduction of the 
transgenic maize plant to subsequent generation plants. 
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65. A fertile, transgenic maize plant, the genome of which has been augmented by the 
introduction of a DNA composition comprising a gene encoding a stress resistance trait selected 
from the group of genes consisting of a glycerol-3-phosphate acetyltransferase gene; a superoxide 
dismutase gene; and a glutathione reductase gene, so that the transgenic plant exhibits one or 
more phenotypic characteristics that render it identifiable over the corresponding untransformed 
maize plant which does not comprise said gene, and wherein said gene is transmittable through 
normal sexual reproduction of the transgenic maize plant to subsequent generation plants. 

66. A fertile, transgenic maize plant, the genome of which has been augmented by the 
introduction of a DNA composition comprising a gene encoding a drought resistance trait 
selected from the group of genes consisting of a mannitol-1 -phosphate dehydrogenase gene; a 
trehalose-6-phosphate synthase gene; a myoinositol O-methyltransferase gene; and a Late 
Embryogenic Protein (LEA) gene, so that the transgenic plant exhibits one or more phenotypic 
characteristics that render it identifiable over the corresponding untransformed maize plant which 
does not comprise said gene, and wherein said gene is transmittable through normal sexual 
reproduction of the transgenic maize plant to subsequent generation plants. 

67. A fertile, transgenic maize plant, the genome of which has been augmented by the 
introduction of a DNA composition comprising a gene encoding a grain composition trait 
selected from the group of genes consisting of an acetyl-CoA carboxylase gene; an ACP- 
acyltransferase gene; a 0-ketoacyl-ACP synthase gene; an acyl carrier protein gene; a fatty acid 
desaturase gene; a fatty acid epoxidase gene; a fatty acid hydratase gene; a fatty acid dehydratase 
gene; a sense or antisense phytoene synthase gene; a sense or antisense phytoene desaturase 
gene; a sense or antisense lycopene synthase gene; a phytase gene; an ADP-glucose 
pyrophosphorylase gene; a starch synthase gene; a starch branching enzyme gene; and a sucrose 
synthase gene, so that the transgenic plant exhibits one or more phenotypic characteristics that 
render it identifiable over the corresponding untransformed maize plant which does not comprise 
said gene, and wherein said gene is transmittable through normal sexual reproduction of the 
transgenic maize plant to subsequent generation plants. 
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FERTILE TRANSGENIC CORN PLANTS ' 



Cross -Reference to Related Application 
This application is a continuation-in-part of U.S. 
5 patent application Serial No. 07/467,983, filed January 22, 
1990. 

Field of the Invention 
This invention relates to fertile transgenic 
10 plants of the species Zea mays (oftentimes referred to 

herein as maize or corn) . The invention further relates to 
producing transgenic plants via particle bombardment and 
subsequent selection techniques which have been found to 
produce fertile transgenic plants. 

15 

Background of the Invention 
Genetic engineering of plants, which entails the 
isolation and manipulation of genetic material (usually in 
the form of DNA or RNA) and the subsequent introduction of 

20 that genetic material into a plant or plant cells, offers 
considerable promise to modern agriculture and plant breed- 
ing. Increased crop food values, higher yields, feed 
value, reduced production costs, pest resistance, stress 
tolerance, drought resistance, the production of pharma- 

25 ceuticals, chemicals and biological molecules as well as 
other beneficial traits are all potentially achievable 
through genetic engineering techniques. Once a gene has 
been identified, cloned, and engineered, it is still neces- 
sary to introduce it into a plant of interest in such a 

30 manner that the resulting plant is both fertile and capable 
of passing the gene on to its progeny. 

A variety of methods have been developed and are 
currently available for the transformation of various 
plants _and plant cells with DNA. Generally, these plants 

35 have been dicotyledonous, and some success has been 
reported with certain of the monocotyledonous cereals. 
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However, some species have heretofore proven untrans form- 
able by any method. Thus, previous to this discovery, no 
technology had been developed which would permit the pro- 
duction of stably transformed Zea mays plants in which the 
transforming DNA is heritable thereof. This failure in the 
art is well documented in the literature and has been 
discussed in a number of recent reviews (Potrykus, 1989; 
Weising et al., 1988; Cocking et al., 1987). 

European Patent Publications 270,356 (McCabe et 
al.) and 275,069 (Arntzen et al.) describe the introduction 
of DNA into maize pollen followed by pollination of maize 
ears and formation of seeds. The plants germinated from 
these seeds are alleged to contain the introduced DNA, but 
there is no suggestion that the introduced DNA was herit- 
able, as has been accomplished in the present invention. 
Only if the DNA introduced into the corn is heritable can 
the corn be used in breeding programs as required for 
successful commercialization of transgenic corn. 

Graves et al. (1986) claim Acrrobacterium-mediated 
transformation of Zea mays seedlings. The evidence was 
based upon assays known to be unreliable. 

Despite extensive efforts to produce fertile 
transformed corn plants which transmit the transforming DNA 
to progeny, there have been no reported successes. Many 
previous failures have been based upon gene transfer to 
maize protoplasts, oftentimes derived from callus, liquid 
suspension culture cells, or other maize cells using a 
variety of transformation techniques. Although several of 
the techniques have resulted in successful transformation 
of corn cells, the resulting cells either could not be 
regenerated into corn plants or the corn plants produced 
were sterile (Rhodes et al. 1988) or, in some cases, it 
even turned out that the plants were, in fact, not trans- 
formed. Thus, while maize protoplasts and some other cells 
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have previously been trans formed, the resulting trans- 
formants could not be regenerated into fertile transgenic 
plants . 

On the other hand, it has been known that at least 
5 certain corn callus can be regenerated to form mature 
plants in a rather straightforward fashion and that the 
resulting plants are often fertile. However, no stable 
transformation of maize callus was ever achieved, i.e., 
there were no techniques developed which would permit a 
10 successful stable transformation of a regenerable callus. 
An example of a maize callus transformation technique which 
has been tried is the use of Aarobacterium -mediated trans- 
fer. 

The art was thus faced with a dilemma. While it 

15 was known that corn protoplast and suspension culture cells 
could be transformed, no techniques were available which 
would regenerate the transformed protoplast into a fertile 
plant. While it was known that corn callus could be regen- 
erated into a fertile plant, there were no techniques known 

20 which could transform the callus, particularly while not 

destroying the ability of the callus both to regenerate and 
to form fertile plants. 

Recently, a new transformation technique has been 
created based upon the bombardment of intact cells and 

25 tissues with DNA-coated micropro jectiles . The technique, 
disclosed in Sanford et al. (1987) as well as in EPO Patent 
Publication 331,855 of J. C. Sanford et al. based upon U.S. 
Serial No. 07/161,807, filed February 29, 1988, has been 
shown effective at producing transient gene expression in 

30 some plant cells and tissues including those from onion, 
maize (Klein et al. 1988a), tobacco, rice, wheat, and 
soybean, and stable expression has been obtained in tobacco 
and soybeans. In fact, stable expression has been obtained 
by bombardment of suspension cultures of Zea mays Black 

35 Mexican Sweet (Klein et al. 1989) which cultures are, 
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however, non-regenerable suspension culture cells, not the 
callus culture cells used in the process of the present 
invention. 

No protocols have been published describing the 
5 introduction of DNA by a bombardment technique into cul- 
tures of regenerable maize cells of any type. No stable 
expression of a gene has been reported by means of bombard- 
ment of corn callus followed by regeneration of fertile 
plants and no regenerable fertile corn has resulted from 

10 DNA-coated microprojectile bombardment of the suspension 
cultures. Thus, the art has failed to produce fertile 
transformed corn plants heretofore. 

A further stumbling block to the successful pro- 
duction of fertile transgenic maize plants has been in 

15 selecting those few trans formants in such a manner that 

neither the regeneration capacity nor the fertility of the 
regenerated trans formant are destroyed. Due to the gener- 
ally low level of trans formants produced by a transforma- 
tion technique, the need for selection of the trans formants 

20 is self-evident. However, selection generally entails the 
use of some toxic agent, e.g., herbicide or antibiotic, 
which may be detrimental to either the regenerability or 
the resultant plant fertility. 

It is thus an object of the present invention to 

25 produce fertile, stably transgenic, Zea mays plants and 

seeds which transmit the introduced gene to progeny. It is 
a further object to produce such stably transgenic plants 
and seeds by a particle bombardment and a selection process 
which results in a high level of viability for at least a 

30 few transformed cells. It is a further object to produce 
fertile stably transgenic plants of other graminaceous 
cereals besides maize. 
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Summary of the Invention 

The present invention relates to fertile trans- 
genic Zea mays plants containing heterologous DNA, prefer- 
ably chromosomally integrated heterologous DNA, which is 
heritable by progeny thereof . 

The invention further relates to all products 
derived from transgenic Zea mays plants , plant cells, plant 
parts , ayid seeds . 

The invention further relates to transgenic Zea 
mays seeds stably containing heterologous DNA and progeny 
which have inherited the heterologous DNA. The invention 
further relates to the breeding of transgenic plants and 
the subsequent incorporation of heterologous DNA into any 
Zea mays plant or line. 

The invention further relates to a process for 
producing fertile transgenic Zea mays plants containing 
heterologous DNA. The process is based upon microprojec- 
tile bombardment, selection, plant regeneration, and coven- 
tional backcrossing techniques. 

The invention further relates to a process for 
producing fertile transformed plants of graminaceous plants 
other than Zea mays which have not been reliably trans- 
formed by traditional methods such as electroporation, 
Agrobacterium , injection, and previous ballistic techni- 
ques . 

The invention further relates to regenerated 
fertile mature maize plants obtained from transformed 
embryogenic tissue, transgenic seeds produced therefrom, 
and Rl and subsequent generations. 



In preferred embodiments, this invention produces 
the fertile transgenic plants by means of a DNA-coated 
micropro jectile bombardment of clumps of friable embryo- 
genie callus , followed by a controlled regimen for selec- 
tion of the transformed callus lines. 

Brief Description of the Drawings 
Figure 1A shows a map of plasmid vector pHYGIl 
utilized in Example I. Figure IB shows the relevant part 
of pHYGIl encompassing the HPT coding sequence and asso- 
ciated regulatory elements. The base pair numbers start 
from the 5' nucleotide in the recognition sequence for the 
indicated restriction enzymes, beginning with the EcoRI 
site at the 5' end of the CaMV 35S promoter. 

Figure 2 shows a map of plasmid vector pBH221 
utilized in Example I, 

Figure 3 is a Southern blot of DNA isolated from 
the PHI callus line and an untrans formed control callus 
line. 

Figure 4 is a Southern blot of leaf DNA isolated 
from Ro plants regenerated from PHI and untrans formed 
callus. 

Figure 5 is a Southern blot of leaf DNA isolated 
from Rl progeny of PHI Ro plants and untrans formed Ro 
plants . 

Figure 6 is a Southern blot of DNA isolated from 
the PH2 callus line and an untrans formed control callus 
line . 

Description of the Preferred Embodiments 
The present invention is directed to the produc- 
tion of fertile transgenic plants and seeds of the species 
Zea mays and to the plants, plant tissues, and seeds 
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derived from such transgenic plants, as well as the subse- 
quent progeny and products derived therefrom. The trans- 
genic plants produced herein include all plants of this 
species, including field corn, popcorn, sweet corn, flint 
5 corn and dent corn. 

"Transgenic" is used herein to include any cell, 
cell line, callus, tissue, plant part or plant, the geno- 
type of which has been altered beneficially by the presence 
of heterologous DNA that was introduced into the genotype 

10 by a process of genetic engineering, or which was initially 
introduced into the genotype of a parent plant by such a 
process and is subsequently transferred to later genera- 
tions by sexual or asexual cell crosses or cell divisions. 
As used herein, "genotype" refers to the sum total of 

15 genetic material within a cell, either chromosomally, or 

extrachromosomally borne. Therefore, the term "transgenic" 
as used herein does not encompass the alteration of the 
genotype of Zea mays by conventional plant breeding methods 
or by naturally occurring events such as random cross- 

20 fertilization or spontaneous mutation. 

By "heritable" is meant that the DNA is capable of 
transmission through a complete sexual cycle of a plant, 
i.e., it is passed from one plant through its gametes to 
its progeny plants in the same manner as occurs in normal 

25 corn. 

The transgenic plants of this invention may be 
produced by (i) establishing a regenerable cell culture, 
preferably a friable embryogenic callus from the plant to 
be transformed, (ii) transforming said cell culture by a 
30 microprojectile bombardment technique, (iii) controllably 
identifying or selecting transformed cells, and (iv) regen- 
erating fertile transgenic plants from the transformed 
cells. Some of the plants of this invention may be pro- 
duced from the transgenic seed produced from the fertile 
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transgenic plants using conventional crossbreeding tech- 
niques to develop transgenic elite lines and varieties, or 
commercial hybrid seed containing heterologous DNA. 

5 i. Plant Lines and Tissue Cultures 

The cells which have been found particularly 
useful to produce the fertile transgenic maize plants 
herein are those callus cells which are regenerable, both 
before and after undergoing a selection regimen as detailed 

10 further below. Generally, these cells will be derived from 
meristematic tissue which contain cells which have not yet 
terminally differentiated. Such tissue in graminaceous 
cereals in general and in maize, in particular, comprise 
tissues found in juvenile leaf basal regions, immature 

15 tassels, immature embryos, and coleoptilar nodes. Prefer- 
ably, immature embryos are used. Methods of preparing and 
maintaining callus from such tissue and plant types are 
well known in the art and details on so doing are available 
in the literature, c.f. Phillips et al. (1988), the disclo- 

20 sure of which is hereby incorporated by reference. 

The specific callus used must be able to regener- 
ate into a fertile plant. The specific regeneration capac- 
ity of particular callus is important to the success of the 
bombardment /selection process used herein because during 

25 and following selection, regeneration capacity may decrease 
significantly. It is therefore important to start with 
cultures that have as high a degree of regeneration capac- 
ity as possible. Callus which is more than about 3 months 
and up to about 36 months of age has been found to have a 

30 sufficiently high level of regenerability and thus is 
preferred. The regenerative capacity of a particular 
culture, may be readily determined by transferring samples 
thereof to regeneration medium and monitoring the formation 
of shoots, roots, and plantlets. The relative number of 

35 plantlets arising per petri dish or per gram fresh weight 
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of tissue may be used as a rough quantitative estimate of 
regeneration capacity. Generally, a culture which will 
produce at least one plant per gram of callus tissue is 
preferred. 

5 While maize callus cultures can be initiated from 

a number of different plant tissues, the cultures useful 
herein are preferably derived from immature maize embryos 
which are removed from the kernels of an ear when the 
embryos are about 1-3 mm in length. This length generally 

10 occurs about 9-14 days after pollination. Under aseptic 
conditions, the embryos are placed on conventional solid 
media with the embryo axis down (scutellum up). Callus 
tissue appears from the scutellum after several days to a 
few weeks. After the callus has grown sufficiently, the 

15 cell proliferations from the scutellum may be evaluated for 
friable consistency and the presence of well-defined 
embryos. By "friable consistency" it is meant that the 
tissue is easily dispersed without causing injury to the 
cells. Tissue with this morphology is then transferred to 

20 fresh media and subcultured on a routine basis about every 
two weeks . 

The callus initiation media is solid because 
callus cannot be readily initiated in liquid medium. In 
preferred embodiments, the initiation/maintenance media is 

25 typically based on the N6 salts of Chu et al. (1975) as 
described in Armstrong et al. (1985) or the MS salts of 
Murashige et al. (1962). The basal medium is supplemented 
with sucrose and 2,4-dichlorophenoxyacetic acid (2,4-D). 
Supplements such as L-proline and casein hydrolysate have 

30 been found to improve the frequency of initiation of callus 
cultures, morphology, and growth. The cultures are gener- 
ally maintained in the dark, though low light levels may 
also be used. The level of synthetic hormone 2,4-D, neces- 
sary for maintenance and propagation, should be generally 

35 about 0.3 to 3.0 mg/1. 
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Although successful transformation and regenera- 
tion has been accomplished herein with friable embryogenic 
callus, this is not meant to imply that other transformable 
regenerable cells, tissue, or organs cannot be employed to 
5 produce the fertile transgenic plants of this invention. 
The only actual requirement for the cells which are trans- 
formed is that after transformation they must be capable of 
regeneration of a plant containing the heterologous DNA 
following the particular selection or screening procedure 
10 actually used. 

II. DNA Used for Transformation 

As used herein, the term "heterologous DNA" refers 
to a DNA segment that has been derived or isolated from one 

15 genotype, preferably amplified and/or chemically altered, 
and later introduced into a Zea mays genotype that may be 
the same Zea mays genotype from which the DNA was first 
isolated or derived. "Heterologous DNA" also includes 
completely synthetic DNA, and DNA derived from introduced 

20 RNA. Generally, the heterologous DNA is not originally 

resident in the Zea mays genotype which is the recipient of 
the DNA, but it is within the scope of the invention to 
isolate a gene from a given Zea mays genotype, and to 
subsequently introduce multiple copies of the gene into the 

25 same genotype, e.g., to enhance production of an amino 
acid. 

Therefore, "heterologous DNA" is used herein to 
include synthetic, semi-synthetic, or biologically derived 
DNA which is introduced into the Zea mays genotype, and 
30 retained by the transformed Zea mays genotype. The DNA 
includes but is not limited to, non-plant genes such as 
those from bacteria, yeasts, animals or viruses; modified 
genes, portions of genes, chimeric genes, as well as genes 
from the same or different Zea mays genotype. 
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The heterologous DNA used for transformation 
herein may be circular or linear, double-stranded or 
single-stranded. Generally, the DNA is in the form of a 
plasmid and contains coding regions of beneficial hetero- 
5 logous DNA with flanking regulatory sequences which promote 
the expression of the heterologous DNA present in the 
resultant corn plant. For example, the heterologous DNA 
may itself comprise or consist of a promoter that is active 
in Zea mays , or may utilize a promoter already present in 

10 the Zea mays genotype that is the transformation target. 

The compositions of and method for constructing 
heterologous DNA which can transform certain plants is well 
known to those skilled in the art, and the same composi- 
tions and methods of construction may be utilized to pro- 

15 duce the heterologous DNA useful herein. The specific 

composition of the DNA is not central to the present inven- 
tion and the invention is not dependent upon the composi- 
tion of the specific transforming DNA used. Weising et al. 
(1988), the subject matter of which is incorporated herein 

20 by reference, describes suitable DNA components, which 
include promoters, polyadenylation sequences, selectable 
marker genes, reporter genes, enhancers, introns, and the 
like, as well as provides suitable references for composi- 
tions therefrom. Sambrook et al. (1989) provides suitable 

25 methods of construction. 

Generally, the heterologous DNA will be relatively 
small, i.e., less than about 30 Kb to minimize any suscep- 
tibility to physical, chemical, or enzymatic degradation 
which is known to increase as the size of the DNA 

30 increases. 

Suitable heterologous DNA for use herein includes 
all DNA which provides for, or enhances, a beneficial 
feature of the resultant transgenic corn plant. The DNA 
may encode proteins or antisense RNA transcripts in order 

35 to promote increased food values, higher yields, pest 
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resistance , disease resistance , and the like. For example, 
the DNA can encode a bacterial dap A for increased lysine 
production; Bacillus thurinaiensis (BT) t-endotoxin or 
protease inhibitor for insect resistance; bacterial ESPS 
5 synthase for resistance to glyphosate herbicide; and chiti- 
nase or glucan endo-l,3-B-glucosidase for fungicidal pro- 
perties. Aside from DNA sequences that serve as transcrip- 
tion units or portions thereof, useful DNA may be untrans- 
cribed, serving a regulatory or a structural function. 

10 Also, the DNA may be introduced to act as a genetic tool to 
generate mutants and/or assist in the identification, 
genetic tagging, or isolation of segments of corn DNA. 
Additional examples may be found in Weising, supra . 

The heterologous DNA to be introduced into the 

15 plant further will generally contain either a selectable 

marker or a reporter gene or both to facilitate identifica- 
tion and selection of transformed cells. Alternatively, 
the selectable marker may be carried on a separate piece of 
DNA and used in a co- transformation procedure. Both selec- 

20 table markers and reporter genes may be flanked with appro- 
priate regulatory sequences to enable expression in plants. 
Useful selectable markers are well known in the art and 
include, for example, antibiotic and herbicide resistance 
genes. Specific examples of such genes are disclosed in 

25 Weising et al., supra. A preferred selectable marker gene 
is the hygromycin B phosphotransferase (HPT) coding 
sequence, which may be derived from E. coli and which 
confers resistance to the antibiotic hygromycin B. Other 
selectable markers include aminoglycoside phosphotrans- 

30 f erase gene of transposon Tn5 (Aphll) which encodes resis- 
tance to the antibiotics kanamycin, neomycin, and G418, as 
well as those genes which code for resistance or tolerance 
to glyphosate, 1,2-dichloropropionic acid methotrexate, 
imidazolinones , sulfonylureas, bromoxynil, phosphonothricin 

35 and the like. Those selectable marker genes which confer 
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herbicide resistance or tolerance are also of commercial 
utility in the resulting transformed plants. 

Reporter genes which encode for easily assayable 
marker proteins are well known in the art. In general, a 
5 reporter gene is a gene which is not present in or 

expressed by the recipient organism or tissue and which 
encodes a protein whose expression is manifested by some 
easily detectable property , e.g., phenotypic change or 
enzymatic activity. Examples of such genes are provided in 

10 Weising et al. r supra . Preferred genes include the chlor- 
amphenicol acetyl transferase gene from Tn9 of E. coli, the 
beta-glucuronidase gene of the uidA locus of E. coli , and 
the lucif erase genes from firefly Photinus pyralis. 

The regulatory sequences useful herein include any 

15 constitutive, inducible, tissue or organ specific, or 
developmental stage specific promoter which can be ex- 
pressed in the particular plant cell. Suitable such pro- 
moters are disclosed in Weising et al., supra . The follow- 
ing is a partial representative list of promoters suitable 

20 for use herein: regulatory sequences from the T-DNA of 
Aarobacterium tumefaciens , including mannopine synthase, 
nopaline synthase, and octopine synthase; alcohol dehydro- 
genase promoter from corn; light inducible promoters such 
as, ribulose-biphosphate-carboxylase small subunit gene 

25 from a variety of species; and the major chlorophyll a/b 
binding protein gene promoter; 35S and 19 S promoters of 
cauliflower mosaic virus; development ally regulated promo- 
ters such as the waxy, zein, or bronze promoters from 
maize; as well as synthetic or other natural promoters 

30 which are either inducible or constitutive, including those 
promoters exhibiting organ-specific expression or expres- 
sion at specific development stage(s) of the plant. 
« 

Other elements such as introns, enhancers, poly- 
adenylation sequences and the like, may also be present on 
35 the DNA. Such elements may or may not be necessary for the 
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function of the DNA, although they can provide a better 
expression or functioning of the DNA by affecting trans- 
cription, stability of the mRNA, or the like. Such 
elements may be included in the DNA as desired to obtain 
5 the optimal performance of the transforming DNA in the 
plant. For example, the maize AdhlS first intron may be 
placed between the promoter and the coding sequence of a 
particular heterologous DNA. This intron, when included in 
a DNA construction, is known to generally increase expres- 

10 sion of a protein in maize cells. (Callis et al. 1987) 
However, sufficient expression for a selectable marker to 
perform satisfactorily can often be obtained without an 
intron. (Klein et al. 1989) An example of an alternative 
suitable intron is the shrunken - 1 first intron of Zea mays. 

15 These other elements must be compatible with the remainder 
of the DNA constructions. 

To determine whether a particular combination of 
DNA and recipient plant cells are suitable for use herein, 
the DNA may include a reporter gene. An assay for expres- 

20 sion of the reporter gene may then be performed at a suit- 
able time after the DNA has been introduced into the recip- 
ient cells. A preferred such assay entails the use of the 
E. coli beta-glucuronidase (GUS) gene (Jefferson et al. 
1987). In the case of the micropro jectile bombardment 

25 transformation process of the present invention, a suitable 
time for conducting the assay is about 2-3 days after 
bombardment. The use of transient assays is particularly 
important when using DNA components which have not pre- 
viously been demonstrated or confirmed as compatible with 

30 the desired recipient cells. 

III. DNA Delivery Process 

The DNA can be introduced into the regenerable 
maize cell cultures , preferably into callus cultures via a 
35 particle bombardment process. A general description of a 
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suitable particle bombardment instrument is provided in 
Sanford et al. (1987), the disclosure of which is incorpor- 
ated herein by reference * While protocols for the use of 
the instrument in the bombardment of maize non-regenerable 
5 suspension culture cells are described in Klein et al. 

(1988a, 1988b, and 1989), no protocols have been published 
for the bombardment of callus cultures or regenerable maize 
cells. 

In a microprojectile bombardment process, also 

10 referred to as a biolistic process, the transport of the 
DNA into the callus is mediated by very small particles of 
a biologically inert material. When the inert particles 
are coated with DNA and accelerated to a suitable velocity, 
one or more of the particles is able to enter into one or 

15 more of the cells where the DNA is released from the par- 
ticle and expressed within the cell. While some of the 
cells are fatally damaged by the bombardment process, some 
of the recipient cells do survive, stably retain the intro- 
duced DNA, and express it. 

20 The particles, called microprojectiles, are gener- 

ally of a high density material such as tungsten or gold. 
They are coated with the DNA of interest. The microprojec- 
tiles are then placed onto the surface of a macropro jectile 
which serves to transfer the motive force from a suitable 

25 energy source to the microprojectiles. After the macro- 
projectile and the microprojectiles are accelerated to the 
proper velocity, they contact a blocking device which 
prevents the macropro jectile from continuing its forward 
path but allows the DNA-coated microprojectiles to continue 

30 on and impact the recipient callus cells. Suitable such 
instruments may use a variety of motive forces such as 
gunpowder or shock waves from an electric arc discharge 
(Swain et al. 1988). An instrument in which gunpowder is 
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the motive force is currently preferred and such is des- 
cribed and further explained in Sanford et al. (1987), the 
disclosure of which is incorporated herein by reference. 

A protocol for the use of the gunpowder instrument 
5 is provided in Klein et al. (1988a, b) and involves two 
major steps. First, tungsten microprojectiles are mixed 
with the DNA, calcium chloride, and spermidine free base in 
a specified order in an aqueous solution. The concentra- 
tions of the various components may be varied as taught. 

10 The preferred procedure entails exactly the procedure of 

Klein et al. (1988b) except for doubling the stated optimum 
DNA concentration. Secondly, the DNA-coated microprojec- 
tiles, macroprojec tiles, and recipient cells are placed in 
position in the instrument and the motive force is applied 

15 to the macroprojectiles. Parts of this step which may be 
varied include the distance of the recipient cells from the 
end of the barrel as well as the vacuum in the sample 
chamber. The recipient tissue is positioned 5 cm below the 
stopping plate tray. 

20 The callus cultures useful herein for generation 

of transgenic plants should generally be about midway 
between transfer periods, and thus, past any "lag" phase 
that might be associated with a transfer to a new media, 
but also before reaching any "stationary" phase associated 

25 with a long time on the same plate. The specific tissue 
subjected to the bombardment process is preferably taken 
about 7-10 days after subculture, though this is not 
believed critical. The tissue should generally be used in 
the form of pieces of about 30 to 80, preferably about 40 

30 to 60, mg. The clumps are placed on a petri dish or other 
surface and arranged in essentially any manner, recognizing 
that (i) the space in the center of the dish will receive 
the heaviest concentration of metal-DNA particles and the 
tissue located there is likely to suffer damage during 

35 bombardment and, (ii) the number of particles reaching a 
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cell will decrease (probably exponentially) with increasing 
distance of the cell from the center of the blast so that 
cells far from the center of the dish are not likely to be 
bombarded and transformed. A mesh screen, preferably of 
5 metal, may be laid on the dish to prevent splashing or 

ejection of the tissue. The tissue may be bombarded one or 
more times with the DNA-coated metal particles. 

IV. Selection Process 

10 Once the calli have been bombarded with the DNA 

and the DNA has penetrated some of the cells, it is neces- 
sary to identify and select those cells which both contain 
the heterologous DNA and still retain sufficient regenera- 
tive capacity. There are two general approaches which have 

15 been found useful for accomplishing this. First, the 

transformed calli or plants regenerated therefrom can be 
screened for the presence of the heterologous DNA by 
various standard methods which could include assays for the 
expression of reporter genes or assessment of phenotypic 

20 effects of the heterologous DNA, if any. Alternatively, 
and preferably, when a selectable marker gene has been 
transmitted along with or as part of the heterologous DNA, 
those cells of the callus which have been transformed can 
be identified by the use of a selective agent to detect 

25 expression of the selectable marker gene. 

Selection of the putative trans formants is a 
critical part of the successful transformation process 
since selection conditions must be chosen so as to allow 
growth and accumulation of the transformed cells while 

30 simultaneously inhibiting the growth of the non- trans formed 
cells. The situation is complicated by the fact that the 
vitality of individual cells in a population is often 
highly dependent on the vitality of neighboring cells. 
Also, the selection conditions must not be so severe that 

35 the plant regeneration capacity of the callus cells and the 
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fertility of the resulting plant are precluded. Thus, the 
effects of the selection agent on cell viability and mor- 
phology should be evaluated. This may be accomplished by 
experimentally producing a growth inhibition curve for the 
5 given selective agent and tissue being transformed before- 
hand. This will establish the concentration range which 
will inhibit growth. 

When a selectable marker gene has been used, the 
callus clumps may be either allowed to recover from the 

10 bombardment on non-selective media, or preferably, directly 
transferred to media containing that agent. 

Selection procedures involve exposure to a toxic 
agent and may employ sequential changes in the concentra- 
tion of the agent and multiple rounds of selection. The 

15 particular concentrations and cycle lengths are likely to 
need to be varied for each particular agent. A currently 
preferred selection procedure entails using an initial 
selection round at a relatively low toxic agent concentra- 
tion and then later round(s) at higher concentration s) . 

20 This allows the selective agent to exert its toxic effect 
slowly over a longer period of time. Preferably, the 
concentration of the agent is initially such that about a 
5-40% level of growth inhibition will occur, as determined 
from a growth inhibition curve. The effect may be to allow 

25 the transformed cells to preferentially grow and divide 

while inhibiting untransformed cells, but not to the extent 
that growth of the transformed cells is prevented. Once 
the few individual transformed cells have grown suffici- 
ently, the tissue may be shifted to media containing a 

30 higher concentration of the toxic agent to kill essentially 
all untransformed cells. The shift to the higher concen- 
tration also reduces the possibility of non- trans formed 
cells habituating to the agent. The higher level is pre- 
ferably in the range of about 30 to 100% growth inhibition. 

35 The length of the first selection cycle may be from about 1 
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to 4 weeks, preferably about 2 weeks. Later selection 
cycles may be from about 1 to about 12 weeks, preferably 
about 2 to about 10 weeks. Putative maize trans formants 
can generally be identified as proliferating sectors of 
5 tissue among a background of non-proliferating cells. The 
callus may also be cultured on non-selective media at 
various times during the overall selection procedure. 

Once a callus sector is identified as a putative 
trans formant, transformation can be confirmed by phenotypic 

10 and/or genotypic analysis. If a selection agent is used, 
an example of phenotypic analysis is to measure the 
increase in fresh weight of the putative transformant as 
compared to a control on various levels of the selective 
agent. Other analyses that may be employed will depend on 

15 the function of the heterologous DNA. For example, if an 
enzyme or protein is encoded by the DNA, enzymatic or 
immunological assays specific for the particular enzyme or 
protein may be used. Other gene products may be assayed by 
using a suitable bioassay or chemical assay. Other such 

20 techniques are well known in the art and are not repeated 
here. The presence of the gene can also be confirmed by 
conventional procedures, i.e.. Southern blot or polymerase 
chain reaction (PCR) or the like. 

25 V. Regeneration of Plants and Pr oduction of Seed 

Cell lines which have been shown to be transformed 
must then be regenerated into plants and the fertility of 
the resultant plants determined. Transformed lines which 
test positive by genotypic and/or phenotypic analysis are 

30 then placed on a media which promotes tissue differentia- 
tion and plant regeneration. Regeneration may be carried 
out in accordance with standard procedures well known in 
the art. The procedures commonly entail reducing the level 
of auxin which discontinues proliferation of a callus and 
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promotes somatic embryo development or other tissue differ- 
entiation. One example of such a regeneration procedure is 
described in Green et al. (1982). The plants are grown to 
maturity in a growth room or greenhouse and appropriate 
5 sexual crosses and selfs are made as described by Neuf fer 
(1982). 

Regeneration, while important to the present 
invention, may be performed in any conventional manner. If 
a selectable marker has been transformed into the cells, 

10 the selection agent may be incorporated into the regenera- 
tion media to further confirm that the regenerated plant- 
lets are trans formed. Since regeneration techniques are 
well known and not critical to the present invention, any 
technique which accomplishes the regeneration and produces 

15 fertile plants may be used. 

VI, Analysis of Rl Progeny 

The plants regenerated from the transformed callus 
are referred to as the RO generation or RO plants. The 

20 seeds produced by various sexual crosses of the RO genera- 
tion plants are referred to as Rl progeny or the Rl genera- 
tion. When Rl seeds are germinated, the resulting plants 
are also referred to as the Rl generation. 

To confirm the successful transmission and inher- 

25 itance of the heterologous DNA in the sexual crosses des- 
cribed above, the Rl generation should be analyzed to 
confirm the presence of the transforming DNA. The analysis 
may be performed in any of the maimers such as were dis- 
closed above for analyzing the bombarded callus for evi- 

30 dence of transformation, taking into account the fact that 
plants and plant parts are being used in place of the 
callus . , 
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VII. Establishment of the Heterologous DNA in Other Maize 
Varieties 

Fertile, transgenic plants may then be used in a 
conventional maize breeding program in order to incorporate 
5 the introduced heterologous DNA into the desired lines or 
varieties. Conventional breeding programs employ a conver- 
sion process (backcrossing) . Methods and references for 
convergent improvement of corn are given by Hallauer et 
al., (1988) incorporated herein by reference. Briefly, 

10 conversion is performed by crossing the initial transgenic 
fertile plant to normal elite inbred lines. The progeny 
from this cross will segregate such that some of the plants 
will carry the heterologous DNA whereas some will not. The 
plants that do carry the DNA are then crossed again to the 

15 normal plant resulting in progeny which segregate once 
more. This backcrossing process is repeated until the 
original normal parent has been converted to a line con- 
taining the heterologous DNA and also possessing all other 
important attributes originally found in the parent. 

20 Generally, this will require about 6-8 generations. A 
separate backcrossing program will be generally used for 
every elite line that is to be converted to a genetically 
engineered elite line. 

Generally, the commercial value of the transformed 

25 corn produced herein will be greatest if the heterologous 
DNA can be incorporated into many different hybrid combina- 
tions. A farmer typically grows several varieties of 
hybrids based on differences in maturity, standability, and 
other agronomic traits. Also, the farmer must select a 

30 hybrid based upon his physical location since hybrids 
adapted to one part of the corn belt are generally not 
adapted to another part because of differences in such 
traits as maturity, disease, and insect resistance. As 
such, it is necessary to incorporate the heterologous DNA 

35 into a large number of parental lines so that many hybrid 
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combinations can be produced containing the desirable 
heterologous DNA. 

Corn breeding and the techniques and skills 
required to transfer genes from one line or variety to 
5 another are well known to those skilled in the art. Thus, 
introducing heterologous DNA into other lines or varieties 
can be readily accomplished by these breeding procedures 
whether or not they generate the appropriate calli. 

10 VIII. Uses of Transgenic Plants 

The transgenic plants produced herein are expected 
to be useful for a variety of commercial and research 
purposes. Transgenic plants can be created for use in 
traditional agriculture to possess traits beneficial to the 

15 grower (e.g., agronomic traits such as pest resistance or 
increased yield) , beneficial to the consumer of the grain 
harvested from the plant (e.g., improved nutritive content 
in human food or animal feed) , or beneficial to the food 
processor (e.g., improved processing traits). In such 

20 uses, the plants are generally grown for the use of their 
grain in human or animal foods. However, other parts of 
the plants, including stalks, husks, vegetative parts, and 
the like, may also have utility, including use as part of 
animal silage or for ornamental purposes (e.g., Indian 

25 corn). Often, chemical constituents (e.g., oils or 

starches) of corn and other crops are extracted for foods 
or industrial use and transgenic plants may be created 
which have enhanced or modified levels of such components. 
The plants may also be used for seed production for a 

30 variety of purposes. 

Transgenic plants may also find use in the commer- 
cial manufacture of proteins or other molecules encoded by 
the heterologous DNA contained therein, where the molecule 
of interest is extracted or purified from plant parts, 

35 seeds, and the like. Cells or tissue from the plants may 
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also be cultured, grown in vitro , or fermented to manufac- 
ture such molecules, or for other purposes (e.g., for 
research) . 

The transgenic plants may also be used in commer- 
5 cial breeding programs, or may be crossed or bred to plants 
of related crop species. Improvements encoded by the 
heterologous DNA may be transferred, e.g., from corn cells 
to cell of other species, e.g., by protoplast fusion. 

The transgenic plants may have many uses in 

10 research or breeding, including creation of new mutant 

plants through insertional mutagenesis, in order to iden- 
tify beneficial mutants that might later be created by 
traditional mutation and selection. The methods of the 
invention may also be used to create plants having unique 

15 "signature sequences" or other marker sequences which can 
be used to identify proprietary lines or varieties. 

The following non-limiting examples are illustra- 
tive of the present invention. They are presented to 
better explain the general procedures which were used to 

20 prepare the fertile Zea mays plants of this invention which 
stably express the heterologous DNA and which transmit that 
DNA to progeny. All parts and percents are by weight 
unless otherwise specified. It must be recognized that a 
specific transformation event is a function of the amount 

25 of material subjected to the transformation procedure. 

Thus, when individual situations arise in which the proce- 
dures described herein do not produce a trans formed pro- 
duct, repetition of the procedures will be required. 

30 Example I. 

Fertile transgenic Zea mays plants which contain 
heterologous DNA which is heritable were prepared as 
follows : 
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I. Initiation and maintenance of maize cell cultu res which 
retain plant regeneration capacity 

Friable, embryogenic maize callus cultures were 
initiated from hybrid immature embryos produced by pollina- 
5 tion of inbred line A188 plants (University of Minnesota, 
Crop Improvement Association) with pollen of inbred line 
B73 plants (Iowa State University). Ears were harvested 
when the embryos had reached a length of 1.5 to 2.0 mm. 
The whole ear was surface sterilized in 50% v/y commercial 

10 bleach (2.63% w/v sodium hypochlorite) for 20 min. at room 
temperature. The ears were then washed with sterile, 
distilled, deionized water. Immature embryos were asepti- 
cally isolated and placed on nutrient medium initiation/- 
maintenance media with the root /shoot axis exposed to the 

15 medium. Initiation/maintenance media (hereinafter referred 
to as "F medium") consisted of N6 basal media (Chu 1975) 
with 2% (w/v) sucrose, 1.5 mg per liter 2,4-dichloro- 
phenoxyacetic acid (2,4-D), 6 mM proline, and 0.25% Gelrite 
(Kelco, Inc., San Diego). The pH was adjusted to 5.8 prior 

20 to autoclaving. Unless otherwise stated, all tissue cul- 
ture manipulations were carried out under sterile condi- 
tions. 

o 

The immature embryos were incubated at 26 C in the 
dark. Cell proliferations from the scutellum of the imma- 

25 ture embryos were evaluated for friable consistency and the 
presence of well-defined somatic embryos . Tissue with this 
morphology was transferred to fresh media 10 to 14 days 
after the initial plating of the immature embryos. The 
tissue was then subcultured on a routine basis every 14 to 

30 21 days. Sixty to eighty milligram quantities of tissue 
were removed from pieces of tissue that had reached a size 
of approximately one gram and transferred to fresh media. 
Subculturing always involved careful visual monitoring to 
be sure that only tissue of the correct morphology was 

35 maintained. The presence of somatic embryos ensured that 



26 

the cultures would give rise to plants under the proper 
conditions. The cell culture named AB12 used in this 
example was such a culture and had been initiated about 1 
year before bombardment. 

II. Plasmids - pCHNl-1, pHYGIl, PBII221. and pLUC-1 

The plasmids pCHNl-1, pHYGIl, and pLUC-1 were 
constructed in the vector pBS+ (Stratagene, Inc., San 
Diego, CA) , a 3.2 Kb circular plasmid, using standard 
recombinant DNA techniques. pCHNl-1 contains the hygro- 
mycin B phosphotransferase (HPT) coding sequence from E. 
coli (Gritz et al. 1983) flanked at the 3' end by the 
nopaline synthase (nos) polyadenylation sequence of Agro- 
bacterium tumefaciens (Chilton and Barnes 1983). Expres- 
sion is driven by the cauliflower mosaic virus (CaMV) 35S 
promoter (Guilley et al. 1982), located upstream from the 
hygromycin coding sequence. The plasmid pHYGIl was con- 
structed by inserting the 553 bp Bcl-BamHI fragment con- 
taining the maize AdhlS first intron (Callis et al. 1987) 
between the CaMV 35S promoter and the hygromycin coding 
sequence of pCHNl-1. A map of pHYGIl is provided as Figure 
1. A sample of pHYGIl was deposited at the American Type 
Culture Collection, Rockville, MD, USA, on March 16, 1990, 
under the provisions of the Budapest Treaty, and assigned 
accession number 40774. 

pBII221 contains the E. coli B-glucuronidase 
coding sequence flanked at the 5' end by the CaMV 35S 
promoter and at the 3 ' end by the nos polyadenylation 
sequence. The plasmid was constructed by inserting the 
maize AdhlS first intron between the 35S promoter and the 
coding sequence of pBl221 (Jefferson et al. 1987). A map 
of pBII221 is provided as Figure 2. 

pLUC-1 contains the firefly luciferase coding 
sequence (DeWet et al. 1987) flanked at the 5' end by the 
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CaMV 35S promoter and at the 3 f end by the nos polyadenyla- 
tion sequence . This plasmid was used solely as negative 
control DNA. 

Plasmids were introduced into the embryogenic 
5 callus culture AB12 by micropro jectile bombardment. 

III. DNA delivery process 

The embryogenic maize callus line AB12 was subcul- 
tured 7 to 12 days prior to micropro jectile bombardment, 

10 AB12 was prepared for bombardment as follows. Five clumps 
of callus, each approximately 50 mg in wet weight were 
arranged in a cross pattern in the center of a sterile 60 x 
15 mm petri plate (Falcon 1007). Plates were stored in a 
closed container with moist paper towels , throughout the 

15 bombardment process. Twenty-six plates were prepared. 

Plasmids were coated onto M-10 tungsten particles 
(Biolistics) exactly as described by Klein et al. (1988b) 
except that, (i) twice the recommended quantity of DNA was 
used, (ii) the DNA precipitation onto the particles was 

20 performed at 0°C, and (iii) the tubes containing the DNA- 
coated tungsten particles were stored on ice throughout the 
bombardment process. 

All of the tubes contained 25 /il 50 mg/ml M-10 
tungsten in water, 25 yl 2.5 M CaCl 2 , and 10 jil 100 mM 

25 spermidine free base along with a total of 5 /il 1 mg/ml 

total plasmid content. When two plasmids were used simul- 
taneously, each was present in an amount of 2.5 jil. One 
tube contained only plasmid pBII221; two tubes contained 
both plasmids pHYGIl and pBH221; two tubes contained both 

30 plasmids pCHNl-1 and pBII221; and one tube contained only 
plasmid pLUC-1. 

All tubes were incubated on ice for 10 min., 

pelletized by centrifugation in an Eppendorf centrifuge at 
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room temperature for 5 seconds, and 25 pi of the super- 
natant was discarded. The tubes were stored on ice 
throughout the bombardment process. Each preparation was 
used for no more than 5 bombardments . 

Macroprojectiles and stopping plates were obtained 
from Biolistics, Inc. (Ithaca, NY). They were sterilized 
as described by the supplier. The microprojectile bombard- 
ment instrument was obtained from Biolistics, Inc. 

The sample plate tray was positioned at the posi- 
tion 5 cm below the bottom of the stopping plate tray of 
the microprojectile instrument, with the stopping plate in 
the slot nearest to the barrel. Plates of callus tissue 
prepared as described above were centered on the sample 
plate tray and the petri dish lid removed. A 7 x 7 cm 
square rigid wire mesh with 3 x 3 mm mesh and made of 
galvanized steel was placed over the open dish in order to 
retain the tissue during the bombardment. Tungsten/DNA 
preparation were sonicated as described by Biolistics, Inc. 
and 2.5 /il was pipetted onto the top of the macroprojec- 
tiles. The instrument was operated as described by the 
manufacturer. The bombardments which were performed are 
summarized on Table 1. 

Table 1. 

To determine transient expres- 
sion frequency 

As a potential positive treat- 
ment for transformation 

As a potential positive treat- 
ment for transformation 

Negative control treatment 



2 x pBH221 prep 
10 x pHYGIl/pBH221 
10 x pCHNl-l/pBH221 
4 x pLUC-1 



The two plates of callus bombarded with pBH221 
were transferred plate for plate to F medium (with no 
hygromycin) and the callus cultured at 26 *C in the dark. 
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After 2 days, this callus was then transferred plate for 
plate into 35 x 10 mm petri plates (Falcon 1008) containing 
2 ml of GUS assay buffer which consists of 1 mg/ml 5-bromo- 
4-chloro-3-indolyl-beta-D-glucuronide (Research Organics), 
5 100 mM sodium phosphate pH 7.0, 5 mM each of potassium 
ferricyanide and potassium f errocyanide , 10 mM EDTA, and 
0.06% Triton X-100. These were incubated at 37 °C for 3 
days after which the number of blue cells was counted 
giving 291 and 477 transient GUS expressing cells in the 
10 two plates, suggesting that the DNA delivery process had 
also occurred with the other bombarded plates. These 
plates were discarded after counting since the GUS assay is 
destructive. 

15 IV. Selection process 

Hygromycin B (Calbiochem) was incorporated into 
the medium by addition of the appropriate volume of filter 
sterilized 100 mg/ml hygromycin B in water when the media 
had cooled to 45 °C prior to pouring plates. 

20 Immediately after all samples had been bombarded, 

callus from all of the plates treated with pHYGIl/pBH221, 
pCHNl-l/pBH221 and three of the plates treated with pLUC-1 
were transferred plate for plate onto F medium containing 
15 mg/1 hygromycin B, (five pieces of callus per plate). 

25 These are referred to as round 1 selection plates. Callus 
from the fourth plate treated with pLUC-1 was transferred 
to F medium without hygromycin. This tissue was subcul- 
tured every 2-3 weeks onto nonselective medium and is 
referred to as unselected control callus. 

30 After two weeks of selection, tissue appeared 

essentially identical on both selective and nonselective 
media. All callus from eight plates from each of the 
pHYGIl/pBII221 and pCHNl-l/pBH221 treatments and two 
plates of the control callus on selective media were trans- 

35 f erred from round 1 selection plates to round 2 selection 
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plates that contained 60 mg/1 hygromycin. The round 2 
selection plates each contained ten 30 mg pieces of callus 
per plate , resulting in an expansion of the total number of 
plates. 

The remaining tissue on selective media, two 
plates each of pHYGIl/pBII221 and pCHNl-l/pBH221 treated 
tissue and one of control callus, were placed in GUS assay 
buffer at 37 °C to determine whether blue clusters of cells 
were observable at two weeks post-bombardment. After 6 
days in assay buffer, this tissue was scored for GUS 
expression. The results are summarized on Table 2. 

Table 2. 



Treatment Replicate Observations 

pLUC-1 No blue cells 

pHYGIl/pBII221 Plate 1 11 single cells 

1 four-cell cluster 

Plate 2 5 single cells 

pCHNl-l/pBH221 Plate 1 1 single cell 

2 two-cell clusters 

Plate 2 5 single cells 

1 two-cell cluster 

2 clusters of 8-10 

cells 



After 21 days on the round 2 selection plates, all 
viable portions of the material were transferred to round 3 
selection plates containing 60 mg/1 hygromycin. The round 
2 selection plates , containing only tissue that was appar- 
ently dead, were reserved. Both round 2 and 3 selection 
plates were observed periodically for viable proliferating 
sectors . 

After 35 days on round 3 selection plates, both 
the round 2 and round 3 sets of selection plates were 
checked for viable sectors of callus. Two such sectors 
were observed proliferating from a background of dead 
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tissue on plates treated with pHYGIl/pBII221. The first 
sector named 3AA was from the round 3 group of plates and 
the second sector named 6L was from the round 2 group of 
plates. Both lines were then transferred to F medium 
5 without hygromycin. 

After 19 days on F medium without hygromycin, the 
line 3AA grew very little whereas the line 6L grew rapidly. 
Both were transferred again to F medium for 9 days. The 
lines 3AA and 6L were then transferred to F medium contain- 

10 ing 15 mg/1 hygromycin for 14 days. At this point, line 
3AA was observed to be of very poor quality and slow grow- 
ing. The line 6L, however, grew rapidly on F medium with 
15 mg/1 hygromycin; the line was then subcultured to F 
medium without hygromycin. 

15 After 10 days on F medium, an inhibition study of 

the line 6L was initiated. Callus of 6L was transferred 
onto F medium containing 1, 10, 30, 100, and 250 mg/1 
hygromycin B. Five plates of callus were prepared for each 
concentration and each plate contained ten approximately 50 

20 mg pieces of callus. One plate of unselected control 

tissue was prepared for each concentration of hygromycin. 

It was found that the line 6L was capable of 
sustained growth over 9 subcultures on 0, 10, 30, 100, and 
250 mg/1 hygromycin. The name of the line 6L was changed 

25 at this time from 6L to PHI (Positive Hygromycin trans for- 
mant 1) . 

Additional sectors were recovered at various time 
points from the round 2 and 3 selection plates. None of 
these were able to grow in the presence of hygromycin for 
30 multiple rounds, i.e., two or three subcultures. 

V. Confirmation of transformed callus 

To show that the PHI callus had acquired the 
hygromycin resistance gene, a Southern blot of PHI callus 
35 was prepared as follows: DNA was isolated from PHI and 
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unselected control calli by freezing 2 g of callus in 
liquid nitrogen and grinding it to a fine powder which was 
transferred. to a 30 ml Oak Ridge tube containing 6 ml 
extraction buffer (7M urea, 250 mM NaCl, 50 mM Tris-HCl pH 
8.0, 20 mM EDTA pH 8.0, 1% sarcosine) . To this was added 7 
ml of phenol: chloroform 1:1, the tubes shaken and incubated 
at 37 °C 15 min. Samples were centrifuged at 8K for 10 min. 
at 4°C. The supernatant was pipetted through miracloth 
(Calbiochem 475855) into a disposable 15 ml tube (American 
Scientific Products, C3920-15A) containing 1 ml 4.4 M 
ammonium acetate, pH 5.2. Isopropanol, 6 ml was added, the 
tubes shaken, and the samples incubated at -20 "c for 15 
min. The DNA was pelleted in a Backman TJ-6 centrifuge at 
the maximum speed for 5 min. at 4°C. The supernatant was 
discarded and the pellet was dissolved in 500 /il TE-10 (10 
mM Tris-HCl pH 8.0, 10 mM EDTA pH 8.0) 15 min. at room 
temperature. The samples were transferred to a 1.5 ml 
Eppendorf tube and 100 pi 4.4 M ammonium acetate, pH 5.2 
and 700 yl isopropanol were added. This was incubated at 
-20*C for 15 min. and the DNA pelleted 5 min. in an Eppen- 
dorf microcentrifuge (12,000 rpm) . The pellet was washed 
with 70% ethanol, dried, and resuspended in TE-1 (10 mM 
Tris-HCl pH 8.0, 1 mM EDTA). 

The isolated DNA (10 yg) was digested with BamHI 
(NEB) and electrophoresed in a 0.8% w/v agarose gel at 15 V 
for 16 hrs in TAE buffer (40 mM Tris-acetate, 1 mM EDTA). 
The DNA within the gel was then depurinated by soaking the 
gel twice in 0.25 HC1 for 15 min., denatured and cleaved by 
soaking the gel twice in 0.5 M NaOH/l.O M NaCl 15 min., and 
neutralized by soaking the gel twice in 0.5 M Tris pH 7.4/3 
M NaCl 30 min. DNA was then blotted onto a Nytran membrane 
(Shleicher & Shuell) by capillary transfer overnight in 6X 
SSC (20X SSC, 3 M NaCl, 0.3 M sodium citrate pH 7.0). The 
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membrane was baked at 80 °C for 2 hrs under vacuum. Prehy- 
bridization treatment of the membrane was done in 6X SSC, 
10X Denhardt's solution, 1% SDS, 50 jig/ml denatured salmon 
sperm DNA using 0.25 ml prehybridization solution per cm 2 

e 

5 of membrane. Prehybridization was carried out at 42 C 
overnight . 

A 32 P labelled probe was prepared by random primer 
labelling with an Oligo Labelling Kit (Pharmacia) as per 
the supplier's instructions with 32 P-dCTP (ICN Radiochemi- 

10 cals). The template DNA used was the 1055 bp BamHI frag- 
ment of pHYGIl, which is the HPT coding sequence. The 
fragment was gel purified and cut again with PstI (NEB) 
before labelling. 

The hybridization was performed in 50% formamide, 

15 6X SSC, 1% SDS, 50 yg/ml denatured salmon sperm DNA 

(Sigma) , 0.05% sodium pyrophosphate and all of the isopro- 
panol precipitated heat denatured probe (10 7 CPM/50 ng 
template). The hybridization was carried out at 42 °C 
overnight . 

20 The membrane was washed twice in 50 ml 6X SSC, 

0.1% SDS 5 min. at room temperature with shaking, then 
twice in 500 ml 6X SSC, 0.1% SDS 15 min. at room tempera- 
ture, then twice in 500 ml IX SSC, 1% SDS 30 min. at 42 °C, 
and finally in 500 ml 0.1X SSC, 1% SDS 60 min. at 65 °C. 

25 Membranes were exposed to Kodak X-OMAT AR film in an X- 
OMATIC cassette with intensifying screens. As shown in 
Figure 3, a band was observed for PHI callus at the ex- 
pected position of 1.05 Kb, indicating that the HPT coding 
sequence was present. No band was observed for control 

30 callus. 

To demonstrate that the hygromycin gene is incor- 
porated into high molecular weight DNA, DNA isolated from 
PHI callus and control callus was treated with (i) no 
restriction enzyme, (ii) BamHI, as described previously, or 
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(iii) PstI, which cuts the plasmid pHYGIl only once within 
the HPT coding sequence. Samples were blotted and probed 
with the HPT coding sequence as described previously. 

Undigested PHI DNA only showed hybridization to 
5 the probe at the position of uncut DNA, demonstrating that 
the hygromycin gene is incorporated into high molecular 
weight DNA. The expected 1.05 Kb band for PHI DNA digested 
with BamHI was observed, as had been shown previously. For 
PHI DNA digested with PstI, a 5.9 Kb band would be expected 

10 if the hygromycin gene was present on an intact pHYGIl 

plasmid. Two or more bands of variable size (size depend- 
ent on the position flanking PstI sites within the host 
DNA) would be expected if the gene was incorporated into 
high molecular weight DNA. Three bands were observed with 

15 approximate molecular sizes of 12, 5.1, and 4.9 Kb. This 
result demonstrates incorporation of the hygromycin gene 
into high molecular weight DNA. The intensity of the 4.9 
Kb band is approximately twice as great as the other two 
bands, suggesting either partial digestion or possibly a 

20 tandem repeat of the HPT gene. No hybridization was 

observed for DNA from control callus in any of the above 
treatments. 

These results prove that the HPT coding sequence 
is not present in PHI callus as intact pHYGIl or as a small 
25 non-chromosomal plasmid. They are consistent with incor- 
poration of the hygromycin gene into high molecular weight 
DNA. 

VI. Plant regeneration and production of seed 
30 PHI callus was transferred directly from all of 

the concentrations of hygromycin used in the inhibition 
study to RM5 medium which consists of MS basal salts 
(Murashige et al. 1962) supplemented with thiamine-HCl 0.5 
mg/1, 2,4-D 0.75 mg/1, sucrose 50 g/1, asparagine 150 mg/1, 
35 and Gelrite 2.5 g/1 (Kelco Inc., San Diego). 
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After 14 days on RM5 medium, the majority of PHI 
and negative control callus was transferred to R5 medium 
which is the same as RMS medium, except that 2,4-D omitted. 

e 

These were cultured in the dark for 7 days at 26 C and 
5 transferred to a light regime of 14 hrs light and 10 hrs 
dark for 14 days at 26 °C. At this point , plantlets that 
had formed were transferred to one quart canning jars 
(Ball) containing 100 ml of R5 medium. Plants were trans- 
ferred from jars to vermiculite for 7 or 8 days before 

10 transplanting them into soil and growing them to maturity* 
A total of 65 plants were produced from PHI and a total of 
30 plants were produced from control callus. 

To demonstrate that the introduced DNA had been 
retained in the Ro tissue, a Southern blot was performed as 

15 previously described on BamHI digested leaf DNA from three 
randomly chosen Ro plants of PHI. As shown in Figure 4, a 
1.05 Kb band was observed with all three plants indicating 
that the HPT coding sequence was present. No band was 
observed for DNA from a control plant. 

20 Controlled pollinations of mature PHI plants were 

conducted by standard techniques with inbred Zea mays lines 
A188, B73, and Oh43. Seed was harvested 45 days post- 
pollination and allowed to dry further 1-2 weeks . Seed set 
varied from 0 to 40 seeds per ear when PHI was the female 

25 parent and 0 to 32 seeds per ear when PHI was the male 
• parent . 

VII. Analysis of the Rl progeny 

The presence of the hygromycin resistance trait 
30 was evaluated by a root elongation bioassay, an etiolated 
leaf bioassay, and by Southern blotting. Two ears each 
from regenerated PHI and control plants were selected for 
analysis. The pollen donor was inbred line A188 for all 
ears • 

35 
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A. Root elongation bioassay 

Seed was sterilized in a 1:1 dilution of commer- 
cial bleach in water plus alconox 0.1% for 20 min. in 125 
ml Erlenmyer flasks and rinsed 3 times in sterile water and 
5 inbibed overnight in sterile water containing 50 mg/ml 
c apt an by shaking at 150 rpm. 

After imbibition, the solution was decanted from 
the flasks and the seed transferred to flow boxes (Flow 
Laboratories) containing 3 sheets of H 2 0 saturated germina- 
10 tion paper. A fourth sheet of water saturated germination 
paper was placed on top of the seed. Seed was allowed to 
germinate 4 days. 

After the seed had germinated , approximately 1 cm 
of the primary root tip was excised from each seedling and 
15 plated on MS salts, 20 g/1 sucrose, 50 mg/1 hygromycin, 
0.25% Gelrite, and incubated in the dark at 26 °C for 4 
days . 

Roots were evaluated for the presence or absence 
of abundant root hairs and root branches. Roots were 
20 classified as transgenic (hygromycin resistant) if they had 
root hairs and root branches, and untrans formed (hygromycin 
sensitive) if they had limited numbers of branches. The 
results are shown in Table 3, hereinbelow. 

25 B. Etiolated leaf bioassay 

After the root tips were excised as described 
above, the seedlings of one PHI ear and one control ear 
were transferred to moist vermiculite and grown in the dark 
for 5 days. At this point, 1 mm sections were cut from the 

30 tip of the coleoptile, surface sterilized 10 seconds, and 
plated on MS basal salts, 20 g/1 sucrose, 2.5 g/1 Gelrite 
with either 0 (control) or 100 mg/1 hygromycin and incu- 
bated in the dark at 26 °C for 18 hrs. Each plate contained 
duplicate sections of each shoot. They were then incubated 

35 in a light regimen of 14 hrs light 10 hrs dark at 26 # C for 
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48 hrs, and rated on a scale of from 0 (all brown) to 6 
(all green) for the percent of green color in the leaf 
tissue. Shoots were classified as untransf ormed (hygro- 
mycin sensitive) if they had a rating of zero and classi- 
fied as transformed (hygromycin resistant) if they had 
a rating of 3 or greater. The results are shown in Table 
1, hereinbelow. 

C. Southern blots 

Seedlings from the bioassays were transplanted to 
soil and were grown to sexual maturity. DNA was isolated 
from 0.8 g of leaf tissue about 3 weeks after transplanting 
to soil and probed with the HPT coding sequence as des- 
cribed previously. Plants with a 1.05 Kb band present in 
the Southern blot were classified as transgenic. As shown 
in Figure 5, two out of seven progeny of PHI plant 3 were 
transgenic as were three out of eight progeny of PHI plant 
10. The blot results correlated precisely with data from 
the bioassays, confirming that the heterologous DNA was 
transmitted through one complete sexual life cycle. All 
data are summarized in Table 3 . 
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Key: + = transgenic; - = nontrans genie; ND = not done 
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Example II . 

The procedure of Example I was repeated with minor 
modifications. 

30 I. Plant Lines and Tissue Cultures 

The embryogenic maize callus line, AB12, was used 
as in Example I. The line had been initiated about 18 
months before the actual bombardment occurred. 

35 II. Plasmids 

The plasmids pBH221 and pHYGIl described in 
Example I were used. 
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III. DNA Delivery Process 

Callus was bombarded exactly as in Example I 
except that the DNA used in the tungsten/DNA preparations 
5 differed. All of the tubes contained 25 yl 50 mg/ml M-10 
tungsten in water, 25 /il 2.5 M CaCl 2/ and 10 yl 100 mM 
spermidine free base along with a total of 5 pi 1 mg/ml 
total plasmid content. One tube contained only plasmid 
pBH221; two tubes contained only plasmid pHYGIl; and one 

10 tube contained no plasmid but 5 jil TE-1 (10 mM Tris-HCl pH 
8.0, 1 mM EDTA pH 8 . 0 ) . 

The following bombardments were done: 2 x pBII221 
prep (for transient expression); 7 x pHYGIl prep (potential 
positive treatment); and 3 x TE prep (negative control 

15 treatment) . 

After all the bombardments were performed, the 
callus from the pBII221 treatment was transferred plate for 
plate to F medium as five 50 mg pieces. After 2 days, the 
callus was placed into GUS assay buffer as per Example I. 

20 Numbers of transiently expressing cells were counted and 

found to be 686 and 845 GUS positive cells, suggesting that 
the particle delivery process had occurred in the other 
bombarded plates. 

25 IV. Selection of Transformed Callus 

After bombardment, the callus from the pHYGIl 
treatments was placed onto round 1 selection plates, F 
medium containing 15 mg/1 hygromycin, as ten 25 mg pieces 
per plate (different from Example I). The same was done 

30 for two of the plates bombarded with the TE preparation 
( selected control callus ) . One plate of callus bombarded 
with the TE preparation was placed onto F medium with no 
hygromycin; this callus was maintained throughout the 
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ongoing experiment as a source of control tissue (unsel- 
ected control callus). 

After 13 days, the callus on round 1 selection 
plates was indistinguishable from unselected control 
5 callus. All of the callus was transferred from round 1 
selection plates to round 2 selection plates containing 60 
mg/1 hygromycin. An approximate five-fold expansion of the 
numbers of plates occurred. 

The callus on round 2 selection plates had 

10 increased substantially in weight after 23 days, but at 
this time appeared close to dead. All of the callus was 
transferred from round 2 selection plates to round 3 selec- 
tion plates containing 60 mg/1 hygromycin. This transfer 
of all material from round 2 to round 3 selection differs 

15 from Example I in which only viable sectors were trans- 
ferred and the round 2 plates reserved. 

At 58 days post-bombardment, three live sectors 
were observed proliferating from the surrounding dead 
tissue. All three lines were from pHYGIl treatments and 

20 were designated 24C, 56A, and 55A. 

After 15 days on maintenance medium, growth of the 
lines was observed. The line 24C grew well whereas lines 
55A and 56A grew more slowly. All three lines were trans- 
ferred to F medium containing 60 mg/1 hygromycin. Unsel- 

25 ected control callus from maintenance medium was plated to 
F medium having 60 mg/1 hygromycin. 

After 19 days on 60 mg/1 hygromycin, the growth of 
line 24C appeared to be entirely uninhibited, with the 
control showing approximately 80% of the weight gain of 

30 24C. The line 56A was completely dead, and the line 55A 
was very close to dead. The lines 24C and 55A were trans- 
ferred again to F 60 mg/1 hygromycin as was the control 
tissue. 

After 23 days on 60 mg/1 hygromycin, the line 24C 
35 again appeared entirely uninhibited. The line 55A was 
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completely dead, as was the negative control callus on its 
second exposure to F medium having 60 mg/1 hygromycin. 

At 88 days post-bombardment, a sector was observed 
proliferating among the surrounding dead tissue on the 
5 round 3 selection plates. The callus was from a plate 
bombarded with pHYGIl and was designated 13E. The callus 
was transferred to F medium and cultured for 19 days. Por- 
tions of the callus were then transferred to (i) F media 
containing 15 mg/1 hygromycin, and (ii) F medium containing 

10 60 mg/1 hygromycin. Control callus was plated on F media 
with 15 mg/1 hygromycin. After 14 days of culture, the 
callus line 13E appeared uninhibited on both levels of 
hygromycin. The control callus appeared to have about 80% 
of the weight gain of 13E. The callus lines were trans- 

15 ferred to fresh media at the same respective levels of 
hygromycin. 

V. Confirmation of Transformed Callus 

A Southern blot was prepared from BamHI-digested 

20 DNA from the line 24C. As shown in Figure 6, a band was 
observed for the line 24C at the expected size of 1.05 Kb 
showing that the line 24C contained the HPT coding 
sequence. No band was observed for DNA from control 
tissue. The name of the callus line 24C was changed to 

25 PH2. 

To demonstrate that the hygromycin gene is incor- 
porated into high molecular weight DNA, DNA isolated from 
PH2 callus and control callus was treated with (i) no 
restriction enzyme, (ii) BamHI, as described previously, 
30 or, (iii) PstI, which cuts the plasmid pHYGIl only once 
within the HPT coding sequence. Samples were blotted and 
probed with the HPT coding sequence as described previ- 
ously. 

Undigested PH2 DNA only showed hybridization to 
35 the probe at the position of uncut DNA, demonstrating that 
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the hygromycin gene is incorporated into high molecular 
weight DNA. The expected 1.05 Kb band for PH2 DNA digested 
with BamHI was observed, as had been shown previously. For 
PH2 DNA digested with PstI, a 5.9 Kb band would be expected 
if the hygromycin gene was present on an intact pHYGIl 
plasmid. Two or more bands of variable size (size depend- 
ent on the position of flanking PstI sites within the host 
DNA) would be expected if the gene was incorporated into 
high molecular weight DNA. Two bands were observed with 
approximate molecular sizes of 6.0 and 3.0 Kb. This result 
is consistent with incorporation of the hygromycin gene 
into high molecular weight DNA. No hybridization was 
observed for DNA from control callus in any of the above 
treatments . 

These results prove that the HPT coding sequence 
is not present in PH2 callus as intact pHYGIl or as a small 
non-chromosomal plasmid. They are consistent with incor- 
poration of the hygromycin gene into high molecular weight 
DNA. 

VI. Plant Regeneration and Production of Seed 

The line PH2, along with unselected control 
callus, were placed onto RM5 medium to regenerate plants as 
in Example I. After 16 days, the callus was transferred to 
R5 medium as in Example I. After 25 d on R5 medium, plant- 
lets were transferred to R5 medium and grown up for 20 
days. At this point, plant lets were transferred to ver- 
miculite for one week and then transplanted into soil where 
they are being grown to sexual maturity. 

Example III. 

The procedure of Example II was repeated exactly 
except that different plasmids were used. 

The plasmids pBH221 and pHYGIl described in 
Example I were used as well as pMS533 which is a plasmid 
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that contains the insecticidal Bacillus thurinqiensis 
endotoxin (BT) gene fused in frame with the neomycin phos- 
photransferase (NPTII) gene. At a position 5' from the 
fusion gene are located segments of DNA from the CaMV and 
5 nopaline synthase promoters. At a position 3' from the 
fusion gene are segments of DNA derived from the tomato 
protease inhibitor I gene and the poly A region of the 
nopaline synthase gene. 

Callus was bombarded exactly as in Example I 

10 except that the DNA used in the tungsten/DNA preparations 
differed. Two tubes contained plasmids pHYGIl and pMS533 
and one tube contained plasmids pHYGIl and pMS533 and one 
tube contained no plasmid but contained 5 pi TE-1 (10 mM 
Tris-HCl pH 8.0, 1 mM EDTA pH 8.0). 

15 The following bombardments were done: 9 x 

pHYGIl/pMS533 (potential positive treatment) and 2 x TE 
prep (control treatment) . 

After bombardment, the callus from the pHYGIl/- 
pMS533 treatments was placed onto round 1 selection plates, 

20 F medium containing 15 mg/1 hygromycin, as ten 25 mg pieces 
per plate. The same was done for one of the plates bom- 
barded with the TE preparation (selected control callus). 
One plate of callus bombarded with the TE preparation was 
placed onto F medium with no hygromycin; this callus was 

25 maintained throughout the ongoing experiment as a source of 
control tissue (unselected control callus). 

After 12 days, the callus on round 1 selection 
plates appeared to show about 90% of the weight gain of the 
unselected control callus All of the callus was trans- 

30 ferred from round 1 selection plates to round 2 selection 
plates containing 60 mg/1 hygromycin as ten 30 mg pieces 
per plate. After 22 days of selection on round 2 selection 
plates, the callus appeared completely uninhibited. All of 
the callus was transferred from round 2 selection plates to 

35 round 3 selection plates containing 60 mg/1 hygromycin. 
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At 74 days post-bombardment/ a single viable 
sector was observed proliferating from the surrounding 
necrotic tissue. The callus line was from pHYGIl/pMS533 
treated material and was designated 86R. The callus line 
5 86R was transferred to F medium. 

After 24 days, the callus line 86R had grown 
substantially. Portions of the callus were then trans- 
ferred to (i) F media containing 15 mg/1 hygromycin, and 
(ii) F media containing 60 mg/1 hygromycin. Control callus 

10 was plated on F media with 15 mg/1 hygromycin. 

After 19 days of culture, the callus line 86R 
appeared to grow rapidly and was uninhibited on both levels 
of hygromycin. The control callus appeared to have only 
about 50% of the weight gain of 86R. The callus lines were 

15 transferred to fresh media at the same respective levels of 
hygromycin to further test the resistance of the callus 
line 86R. After 26 days of culture, the callus line 86R 
appeared uninhibited on 60 mg/1 hygromycin. 

Southern blots were performed on DNA isolated from 

20 the callus line 86R and control callus to confirm the 

presence of the hygromycin resistance gene and to determine 
whether the BT gene was present. 

For detection of the HPT coding sequence, DNA 
isolated from 86R callus and control callus was digested 

25 with the restriction enzymes BamHI, Xhol, or PstI as des- 
cribed in Examples I and II. After hybridization with a 
probe prepared from the HPT coding sequence, the following 
bands were observed. For the BamHI digest, bands were 
observed at the expected size of 1.05 Kb as well as at 

30 approximately 3.0 and 2.3 Kb. This result demonstrates 

that the HPT coding sequence is present in the callus line 
86R. The additional bands at 3.0 and 2.3 Kb indicate that 
either digestion was incomplete or that multiple rearranged 
copies are present. For the Xhol digest, a single band was 

35 observed at approximately 5.1 Kb. Because Xhol does not 
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cut pHYGIl, this suggests incorporation of the hygromycin 
construct into DNA different than pHYGIl. For the PstI 
digestion, a large band was observed at approximately 5.1 
Kb. This band appeared to be two fragments of similar 
5 molecular weight. Two or more bands would be expected from 
a PstI digestion if the gene was incorporated into high 
molecular weight DNA. In no case was hybridization 
observed for DNA from control callus for any of the above- 
mentioned digestions. 

10 For detection of the BT gene, a Southern blot was 

carried out on DNA isolated from 86R and control callus 
digested with the enzymes BamHI and Xhol in combination. A 
BamHI, Xhol co-digestion liberates the 1.8 Kb BT coding 
sequence from the pMS533 construction used in this trans- 

15 formation. The blot prepared was hybridized to a probe 
prepared from the 1.8 Kb BT coding sequence. A band was 
observed for 86R DNA at the expected size of 1.8 Kb whereas 
no hybridization was observed for control DNA. Additional 
bands of much lesser intensity were also observed for 86R 

20 DNA. This result demonstrates that the BT coding sequence 
is present in the callus line 86R. This further demon- 
strates the introduction into maize of an unselected gene 
with potential commercial value. The name of callus line 
86R was changed to CB1. 

25 Plants are being regenerated from CBl callus and 

control callus as described in Example I. 

COMPARATIVE EXAMPLE A 
The basic procedures of Examples I-III have been 
30 attempted except varying the selection regime or the form 
of the callus. These other attempts , which are detailed in 
Table 4 below, were not successful. Since they were not 
repeated several times, it is not known whether they can be 
made to work. In all of the procedures, no viable sectors 
35 were observed. In the Table, "sieved" indicates that the 
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callus was passed through an 860 micron sieve before bom- 
bardment. The selection agent was hygromycin for each case 
except when pMXTIl was the plasmid and methotrexate the 
selection agent. 



Table 4 
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The invention has been described with reference to 
25 various specific and preferred embodiments and techniques. 
However , it should be understood that many variations and 
modifications may be made while remaining within the spirit 
and scope of the invention. 
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WHAT IS CLAIMED IS : 

1. A fertile transgenic Zea mays plant containing hetero- 
logous DNA which is heritable. 

2. The plant of claim 1 wherein the heterologous DNA is 
chromosomally integrated. 

3. The plant of claim 1 wherein the heterologous DNA is 
expressed. 

4. The plant of claim 1 wherein the heterologous DNA 
encodes a protein. 

5. The plant of claim 1 wherein the heterologous DNA com- 
prises a promoter. 

6. The plant of claim 1 selected from the group consisting 
essentially of field corn, popcorn, sweet corn, flint 
corn, and dent corn. 

7. The plant of claim 1 wherein said heterologous DNA 
comprises a DNA sequence selected from the group con- 
sisting of a bacterial dap A gene, a BT-endotoxin gene, 
a protease inhibitor gene, a bacterial ESPS synthase 
gene, a chitinase gene, a glucan endo-1, 3-B-glucosidase 
gene, structural DNA, regulatory sequence, an iden- 
tification sequence and a sequence encoding antisense 
RNA. 

8. The plant of claim 1 wherein said heterologous DNA 
encodes a beneficial trait to the plant. 



9. The plant of claim 8 wherein said beneficial trait is 
selected from the group consisting of promoting 
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increased food value, higher yield, reduced production 
Hcost7)pest resistance, stress tolerance, drought resis- 
tance, and disease resistance. 

The plant of claim 1 which expresses a selectable 
marker gene or a reporter gene. 

The plant of claim 10 wherein the selectable marker 
gene confers resistance or tolerance to a compound 
selected from the group consisting of hygromycin, 
" kanamyc i in 7" G4 1 8 TCl^? -dichToropropio neomyc in , 

phosphonotricin, glyphosate, methotrexate^ imidazo- 
linone, chlorsulfuron, and bromoxynil. 

The plant of claim 11 wherein the selectable marker 
gene confers resistance or tolerance to hygromycin. 

The plant of claim 10 which expresses a reporter gene. 

The seed produced by the plant of claim 1 which has 
inherited the heterologous DNA. 

The grain produced from the plant of claim 1 wherein 
the heterologous DNA increases the feed value or the 
food value of said grain. 

The Rl and subsequent generations derived from the 
plant of claim 1. 

A process for producing a fertile transgenic Zea mays 
plant which stably expresses heterologous DNA which is 
heritable, wherein the process comprises the steps of 
(i) establishing a regenerable cell culture from a 
plant to be transformed, (ii) transforming said culture 
by bombarding it with DNA-coated microprojectiles, 
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(iii) identifying or selecting a transformed cell line, 
and (iv) regenerating a fertile transgenic plant there- 
from . 

18. The process of claim 17 wherein the cell culture is a 
friable, embryogenic callus culture. 

19. The process of claim 18 wherein the callus culture sub- 
jected to bombardment is in clumps of about 30 to 80 mg 
per clump. 

20. The process of claim 19 wherein said callus is initi- 
ated on solid media. 

21. A plant prepared by the process of claim 17 wherein the 
plant is regenerated from transformed callus. 

22. A transgenic Zea mays plant comprising heterologous DNA 
which is heritable, wherein said DNA was introduced 
into said plant, or an ancestor of said plant, at an 
embryogenic stage. 

23. The transgenic Zea mays plant of claim 22 wherein the 
heterologous DNA was introduced into said plant by 
micropro jectile bombardment. 

24. The transgenic Zea mays plant of claim 23 wherein the 
heterologous DNA was introduced by micropro jectile bom- 
bardment of a callus culture. 
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ABSTRACT OF THE DISCLOSURE 
Fertile transgenic Zea mays (corn) plants which 
stably express heterologous DNA which is heritable are 
provided along with a process for producing said plants. 
The preferred process comprises the microprojectile bom- 
bardment of friable embryogenic callus from the plant to be 
transformed. The process may be applicable to other grami- 
naceous cereal plants which have not proven stably trans- 
formable by other techniques. 
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PROCESS FOR TRANSFORMING GRAMINEAE 
AND THE PRODUCTS THEREOF 

This is a continuation of application Ser. No. 5 
07/067,902, filed Jun. 29, 1987, now abandoned, which 
was a continuation-in-part of application Ser. No. 
06/880,271, filed Jun. 30, 1986, now abandoned. 



BACKGROUND OF THE INVENTION 
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Virulent strains of the soil bacterium Agrobacterium 
tumefaciens are known to infect dicotyledonous plants 
and to elicit a neoplastic response in these plants. The 
tumor-inducing agent in the bacterium is a plasmid that 
functions by transferring some of its DNA into its host 15 
plant's cells where it is integrated into the chromosomes 
of the host plant's cells. This plasmid is called the Ti 
plasmid, and the virulence of the various strains of A. 
tumefaciens is determined in part by the vir region of the 
Ti plasmid which is responsible for mobilization and 20 
transfer of the T-DNA. The T-DNA section is delim- 
ited by two 23-base-pair repeats designated right border 
and left border, respectively. Any genetic information 
placed between these two border sequences may be 
mobilized and delivered to a susceptible host. Once 25 
incorporated into a chromosome, the T-DNA genes 
behave like normal dominant plant genes. They are 
stably maintained, expressed and sexually transmitted 
by transformed plants, and they are inherited in normal 
Mendelian fashion. . 30 

The lump of plant tumor tissue that grows in an undif- 
ferentiated way at the site of the A. tumefaciens infection 
is called a crown gall. Cells of crown gall tumors in- 
duced by A. tumefaciens synthesize unusual amino acids 
called opines. Different strains of A. tumefaciens direct 35 
the synthesis of different opines by the crown gall cells, 
and the particular opine induced is a characteristic of 
the strain which infected the plant. Further, the ability 
to catabolize the particular opine induced by a given 
strain is also characteristic of that strain. 40 

Opines are not normally synthesized by A. tumefach 
ens or by the uninfected host plants. Although it is the 
T-DNA which codes for the enzymes involved in the 
synthesis of the opines, the opine synthases, these genes 
are expressed only in infected plant tissue. This type of 45 
expression is consistent with the observation that these 
genes are under the control of eukaryotic regulatory 
sequences on the T-DNA. 

The most common opines are octopine and nopaline. 
The opine synthase that catalyzes the synthesis of octo- 50 
pine is lysopine dehydrogenase, and the opine synthase 
that catalyzes the synthesis of nopaline is nopaline dehy- 
drogenase. 

When crown gall cells are put into culture, they grow 
to form a callus culture even in media devoid of the 55 
plant hormones that must be added to induce normal 
plant cells to grow in culture. A callus culture is a disor- 
ganized mass of relatively undifferentiated plant cells. 
This ability of crown gall cells to grow in hormone-free 
media is also attributable to the presence of the T-DNA 60 
in the tran>formed host plant cells since genes which 
direct the synthesis of phyto-hormone are also associ- 
ated with the T-DNA. 

A DNA segment foreign to the A. tumefaciens and to 
the host plant which is inserted into the T-DNA by 65 
genetic manipulation will also be transferred to host 
plant's cells by A. tumefaciens. Thus, the Ti plasmid can 
be used as a vector for the genetic engineering of host 



plants. Although, in wild type A. tumefaciens there is 
only one Ti plasmid per bacterium, in genetically-engi- 
neered A. tumefaciens, the vir region and the T-DNA do 
not have to be carried on the same Ti plasmid for trans- 
fer of the T-DNA to occur. The vir region and the 
T-DNA can be carried on separate plasmids contained 
within the same Agrobacterium. 
. It has generally been assumed that the host range of 
A. tumefaciens was limited to the dicotyledons, and that 
transformation of monocotyledons could not be ef- 
fected with this bacterium. Indeed, no one has reported 
the transformation of any member of the monocotyle- 
donous Gramineae family by infection with A, tumefaci- 



However, recently, Hooykaas-Van Slogteren et al., 
in Nature. 311, 763 (1984), reported the production of 
small swellings at wound sites infected with A. tumefaci- 
ens on monocotyledons of the Liliaceae and Amaryl- 
lidaceae families. Opines were detected in plant cells 
taken from the wound sites of the infected plants. ■ , 

Also, Hernalsteens et al. reported in The EMBO Jour- 
nal, 3,. 3039 (1984) that cultured stem fragments of the 
monocotyledon Asparagus officinalis, a member of the 
family Liliaceae, infected with A. tumefaciens strain C58 
developed tumorous proliferations. One of these tumor- 
ous proliferations could be propagated on hormone-free 
medium, and opines were detected in the established 
callus culture derived from this tumorous proliferation. 

In 1982, Anne C. F. Graves reported in her Ph.D. 
Dissertation entitled "Some Tumorigenic Activities of 
Agrobacterium Tumefaciens (Smith and Town) Conn." 
(Bowling Green State University) that irregular masses 
of tissue developed on gladiolus discs in response to 
inoculations with A. tumefaciens C58N and B6. These 
masses of tissue appeared to be the same as, and have 
cellular morphology similar to, those that develop on 
potato tuber discs. Gladiolus is a member of the mono- 
cotyledonous lridaceae family. A compound that co- 
migrated with the octopine standard during electropho- 
resis was found in the proliferations on the gladiolus 
discs that were induced by strain B6, and one that mi- 
grated just behind the octopine standard occurred in 
those induced by C58N. Also, octopine dehydrogenase 
was found in extracts of the cellular proliferations in- 
duced by A. tumefaciens B6 but not in those induced by 
A. tumefaciens C58N. 

Dr. Graves also described the response of certain 
other monocots to inoculation with A, tumefaciens. No 
cellular proliferation was observed on ginger root rhi- 
zome discs, and the results with tulip bulb discs were 
inconclusive. Cellular proliferations on discs of the 
rhizomes of cattail and skunk cabbage were limited to 
several layers of clear cells at the ends of vascular bun- 
dles in the early spring. 

DeCleene and DeLey in The Botanical Review, 42, 
389 (1976) reported the results of an extensive study of 
the plant host range of A. tumefaciens. Their article 
teaches that monocots of the orders Liliales and Arales 
are susceptible to infection with A. tumefaciens but that 
monocotyledons in general are unsusceptible to A. tu- 
mefaciens infection. In particular, their article reports 
that the Gramineae tested were not susceptible to infec- 
tion with/4, tumefaciens. Susceptibility to A. tumefaciens 
infection was determined by whether a swelling or 
tumor developed at the wound site. 

Lorz et al. in <Mol Gen. Genet., 199, 178 (1985), 
Fromm et al in Nature, 319, 791 (1986) and Portrykus et 
al. in Mol Gen. Genet, 199, 183 (1985) have reported the 
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transformation of Gramineae by direct gene transfer to 
protoplasts. Protoplasts are plant cells from which the 
cell wall has been removed by digestion with enzymes. 
Lorz et al. transformed protoplasts of Triticum mono- 
coccum using a DNA construct containing the nopaline 5 
synthase promotor and the polyadenylation regulatory 
signal of the octopine synthase gene. Fromm et al dis- 
closes that the electroporation-mediated transfer of 
plasmid pCaMVNEO (comprising the cauliflower mo- 
saic virus 35 S promoter, the neomycin phosphotrans- 10 
ferase II gene from the transposon Tn5 and the nopaline 
synthase 3' region) into maize protoplasts results in 
stably-transformed maize cells that are resistant to kana- 
mycin. 

To obtain transformed plants from the transformed 15 
cells generated using either the infection techniques of 
Hooykaas-Van Slogteren et al., Hernalsteens et al., 
Graves and DeCleene and DeLey or the direct gene 
transfer techniques of Lorz et al., Fromm et al. and 
Portrykus et al., the plants would have to be regener- 20 
ated from protoplasts or single cell cultures. However, 
no one has yet been able to regenerate plants from pro- 
toplasts or single cell cultures of the Gramineae. In- 
deed, no means of producing transformed plants or 
other transformed differentiated organs or tissues of the 25 
Gramineae is currently known, and no method yet ex- 
ists for transforming the Gramineae in a manner which 
allows for the expression of exogenous DNA in agricul- 
turally important forms or parts of the Gramineae such 
as seeds, pollen, ears or plants. 30 

Finally, PCT International Publication No. WO 
86/00931 (Simpson et al.) published Feb. 13, 1986, 
teaches in vivo methods for transforming and regener- 
ating intact plants. This patent application discloses that 
the methods of the invention. can be used for the trans- 35 
formation of any plant that forms a shooty tumor fol- 
lowing infection with an A. tumefaciens shooty mutant 
strain. However, as noted above, the Gramineae are not 
known to produce tumors or even swellings in response 
to inoculation with A. tumefaciens. In the practice of the 40 
present invention, no tumors, swellings or cellular pro- 
liferations of any kind have been observed on the 
Gramineae in response to inoculation with A. tumefaci- 
ens. 



SUMMARY OF THE INVENTION 
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According to the present invention, there is now 
provided a method of producing transformed Grami- 
neae comprising making a wound in a seedling in an 
area of the seedling containing rapidly dividing cells 50 
and inoculating the wound with vir+ A. tumefaciens. In 
the preferred practice of the invention, the wound is 
made in an area of the seedling which gives rise to the 
germ line cells. Also, preferred is the use of vir+ A. 
tumefaciens which contains a vector comprising geneti- 55 
cally-engineered T-DNA. . 

According to another aspect of the invention, there 
are provided: (1) transformed pollen grains; (2) a trans- 
formed pollen grain produced by a plant grown from a 
seedling infected with vir+ A. tumefaciens: (3) a trans- 60 
formed pollen graju produced by a plant grown from a 
seedling infected with vir+ A. tumefaciens which con- 
tains a vector comprising genetically-engineered T- 
DNA; (4) a pollen grain whose cells contain at least a 
segment of T-DNA; and (5) Gramineae derived from 65 
each of these four pollen grains. In addition, there are 
provided: (1) a transformed Gramineae plant; (2) a 
transformed Gramineae plant derived from a seedling 



infected with vir + A. tumefaciens; (3) a transformed 
Gramineae plant derived from a seedling infected with 
vir + A. tumefaciens which contains a vector comprising 
genetically-engineered T-DNA; and (4) a Gramineae 
plant whose cells contain a segment of T-DNA. Finally, 
there are provided transformed Gramineae derived 
from seedlings infected with vir + A. tumefaciens and 
transformed Gramineae derived from seedlings infected 
with vir + A. tumefaciens which contains a vector com- 
prising genetically-engineered T-DNA. 

The invention is clearly useful since it provides, for 
the first time, a method for transforming Gramineae 
which results in the production of transformed differen- 
tiated organs and tissue such as leaves, plants and pol- 
len. Thus, the invention provides, for the first time, a 
method of transforming Gramineae which allows for 
the expression of exogenous DNA in agriculturally 
important forms or parts of the Gramineae. Many of the 
Gramineae (such as corn, oats, rye, barley, sorghum, 
rice and wheat) are, of course, commercially important 
food sources for humans and other animals, and the 
invention allows for the development of strains of 
Gramineae having altered or superior traits, such as 
higher yielding strains and strains having resistance to 
herbicides or better nutritional value, by providing a 
means whereby exogenous DNA coding for such traits 
can be incorporated into the Gramineae. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGS. 1-18 show a corn seedling which is ninety-six 
hours old. FIG. 1A is the side view of the seedling, and 
FIG. IB is the front view of the same seedling. 

FIG. 2A is a drawing of developed paper electropho- 
retogram which shows t)ie results of the electrophoresis 
of the products produced by incubating cell-free ex- 
tracts of yellow Iochief corn seedlings inoculated with 
A. tumefaciens strain B6 with a reaction medium con- 
taining reagents which allow for the detection of lyso- 
pine dehydrogenase enzyme activity (lysopine dehy- 
drogenase reaction medium). Certain controls were also 
electrophoresed. 

FIG. 2B is a drawing of a developed paper electro- 
phoretogram which shows the results of the electropho- 
resis of the products produced by incubating cell-free 
extracts of yellow Iochief corn seedlings with a reaction 
medium containing reagents which allow for the detec- 
tion of nopaline dehydrogenase enzyme activity (nopa- 
line dehydrogenase reaction medium). Certain controls 
were also electrophoresed. 

FIG. 2C is a drawing of a developed paper electro- 
phoretogram which shows the presence or absence of 
pyronopaline in various materials. 

FIG. 2D is a drawing of a developed paper electro- 
phoretogram. The materials electrophoresed were pro- 
duced by the catabolism by certain strains of A. tumefa- 
ciens of the products produced by incubating cell-free 
extracts of yellow Iochief corn seedlings inoculated 
with A. tumefaciens strain C58 with nopaline dehydro- 
genase reaction medium. Certain controls were also 
electrophoresed. 

FIG. 3A is a drawing of a developed paper elect ro- 
phoretogram which shows the results of the electropho- 
resis of the products produced by incubating cell-free 
extracts of single B6-inoculated yellow Iochief corn 
seedlings with lysopine dehydrogenase reaction me- 
dium. 

FIG. 3B is a drawing of a developed paper electro- 
phoretogram which shows the results of the electropho- 
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resis of the products produced by incubating cell-free 
extracts of single yellow lochief corn seedlings inocu- 
lated with either A. tumefaciens strain A348 or strain 
238MX with lysopine dehydrogenase reaction medium. 

FIG. 3C is a drawing of a developed paper electro- 
phoretogram showing the results of the electrophoresis 
of the products produced by incubating cell-free ex- 
tracts of single C58-inoculated yellow lochief corn 
seedlings with nopaline dehydrogenase reaction me- 
dium. j:J 

FIG. 3D is a drawing of a developed paper electro- 
phoretogram showing the results of the electrophoresis 
of the products produced by incubating cell-free ex- 
tracts of single yellow lochief corn seedlings inoculated 
with A. tumefaciens strain JK 195 with nopaline dehy- 
drogenase reaction medium. < 

FIG. 4A is a drawing of a developed paper electro- 
phoretogram which is the result of the electrophoresis 
of the cell-free sonicates of A. tumefaciens strains B6 and 
C58. 

FIG. 4B is a drawing of a developed paper electro- 
phoretogram showing the results of the electrophoresis 
of the products produced by incubating the cell-free 
sonicates of A. tumefaciens strains B6 and C58 for four 
hours with an appropriate reaction medium. Also elec- 
trophoresed were the reaction media alone. 

FIG. 4C is a drawing of a developed paper electro- 
phoretogram which shows the results of the electropho- 
resis of the product produced by incubating the cell-free 
extracts of single uninfected seedlings of the yellow 
lochief strain of corn for four hours with lysopine dehy- 
drogenase reaction' medium or with nopaline dehydro- 
genase reaction mediun). 

FIG. 5 A is a drawing of a developed electrophoreto- 
gram which is the result of the electrophoresis of the 
products produced by incubating the cell-free extracts 
of single embryonic -leaves from plants grown from 
C58-inoculated yellow lochief corn seedlings with 
nopaline dehydrogenase reaction medium. 

FIG. 5B is a drawing of a corn leaf of meristematic 
origin showing the areas dissected for assay for nopaline 
dehydrogenase activity. 

FIG. 5C is a drawing of a developed electrophoreto- 
gram showing the results of the electrophoresis of the 
products produced by incubating the cell-free extracts 45 
of the dissected sections of four meristematic leaves 
from four separate plants grown from C58-inoculated 
yellow lochief corn seedlings with nopaline dehydro- 
genase reaction medium. 

FIG. 5D is a drawing of a developed electrophoreto- 
gram showing the results of the electrophoresis of the 
products produced by incubating cell-free extracts of 
pollen from individual plants grown from C58- 
inoculated yellow lochief corn seedlings with nopaline 
dehydrogenase reaction medium. 

FIG. 6 is a drawing of a developed electrophoreto- 
gram which is the result of the electrophoresis of the 
products produced by incubating the cell-free extracts 
of single seedlings of yellow lochief corn inoculated 
with A. tumefaciens strain CA19 with lysopine dehydro- 
genase reaction medium. 

FIG. 7 is a drawing of a developed electrophoreto- 
gram which shows the results of the electrophoresis of 
the products produced by incubating the cell-free ex- 
tracts of single C58-inoculated seedlings of strain PA91 
corn with nopaline dehydrogenase reaction medium. 

FIG. 8 is a restriction site and function map of plas- 
mid pCEL30. 
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FIG. 9 is a restriction site and function map of plas- 
mid pCEL44. 

FIG. 10 is a drawing of developed paper electropho- 
retogram which shows the results of the electrophoresis 
of the products produced by incubating cell-free ex- 
tracts of single B6-inoculated rye seedlings with lyso- 
pine dehydrogenase reaction medium. 

FIG. 11 is a drawing of a developed paper electro- 
phoretogram which shows the results of the electropho- 
resis of the products produced by incubating cell-free 
extracts of single B6-inoculated barley seedlings with 
lysopine dehydrogenase reaction medium. 

FIG. 12 is a drawing of a developed paper electro- 
phoretogram showing the results of the electrophoresis 
of the products produced by incubating cell-free ex- 
tracts of single C58-inoculated oat seedlings with nopa- 
line dehydrogenase reaction medium. 

FIG. 13 is a drawing of a developed electrophoreto- 
gram which is the result of the electrophoresis of the 
products produced by incubating the cell-free extracts 
of single C58-inoculated wheat seedlings with nopaline 
dehydrogenase reaction medium and with lysopine 
dehydrogenase reaction medium. 

FIG. 14 is a drawing of a developed electrophoreio- 
gram showing assays for enzymatic activity in the upper 
leaves of transformed plants. 

In all of the drawings where they are used, the desig- 
nations "O" and "OCT" mean octopine, and the desig- 
nations "N" or "NOP" mean nopaline. These designa- 
tions are used on the drawings to refer to the lane of the 
electrophoretogram containing the synthetic octopine 
or nopaline standard or to show the location of spot 
formed by the synthetic octopine or nopaline standard 
after electrophoresis. 

In some of the drawings there are spots which do not 
co-migrate with either the synthetic octopine or nopa- 
line standards. These spots are formed by unreacted 
reactants in reaction media or are formed by naturally- 
occurring materials found in the cell-free extracts of the 
corn seedlings.- 

DETAILED DESCRIPTION OF THE 
PRESENTLY PREFERRED EMBODIMENTS 

According to the invention, there is provided a 
method of producing -transformed Gramineae compris- 
ing making a wound in a seedling in an area of the s< d- 
ling containing rapidly dividing cells and inoculating 
the wound with vir+ A. tumefaciens. The term Grami- 
neae, as used herein, is meant to include all of the forms 
and parts of the Gramineae, including plants, seeds, 
seedlings, pollen, kernels, ears, leaves, stalks and em- 
bryos. Similarly, the terms "corn", "oats", "wheat," 
"rye" and "barley" are meant to include all forms and 
parts of the corn, oats, wheat, rye or barley. "Trans- 
formed" is used herein to mean genetically modified by 
the incorporation of exogenous DNA into cells. "Exog- 
enous DNA" is DNA not normally found in the strain 
of Gramineae which is to be transformed. Exogenous 
DNA may be obtained from prokaryotic or eukaryotic 
sources, including strains of Gramineae other than the 
one to be transformed. 

In practicing the method of the invention, the strain 
of Gramineae to be transformed is sterilized and is then 
germinated until the radicle (primary root) and stem 
emerge from the seed. This occurs after about four days 
of germination, but this time may be shortened by first 
soaking the seeds. 
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The wound is preferably made in an area of rapidly 
dividing cells which gives rise to germ line cells. After 
making the wound, the seedling is inoculated by drip- 
ping a solution of vir+ A. wmefaciens into the wound. 
Vir+ A. tumefaciens are bacteria which are capable of 
mobilizing and transferring T-DNA into host plant 
cells, and an A. tumefaciens carrying a plasmid, whether 
natural or engineered, coding for these functions is 
vir+. Thus, a strain of A. tumefaciens that carries a 
wild-type Ti plasmid is vir+ and can be used in the 
present invention. Many such strains are known and are 
publicly available. See e.g., American Type Culture 
Collection Catalogue (ATCC) of Strain I, p. 66 (15th 
edition, 1982). The vir+ region of a wild type Ti plas- 
mid can be used to mobilize and transfer T-DNA on the 
same Ti plasmid or to deliver T-DNA on another plas- 
mid contained in the same bacterium. In addition, the 
mobilization and transfer functions can be supplied by 
helper plasmids. Such helper plasmids have been de- 
scribed by Ditta et al. in PNAS, 77, 7347 (1980) and by 
Bagdasarian et al. in Gene, 16, 237 (1981). Thus, a strain 
of A. tumefaciens that carries a helper plasmid is also 
vir+. Finally, the mobilization and transfer functions 
may be coded on the same engineered plasmid which 
contains the T-DNA, and bacteria containing such a 
plasmid are also vir + . 

The T-DNA transferred by the vir+ A. tumefaciens 
may be native T-DNA or may preferably be genetical- 
ly-engineered T-DNA. Genetically-engineered T-DNA 
is a DNA construct comprising T-DNA border sequen- 
ces, a heterologous gene and a transcription unit con- 
nected in operable order. Methods of preparing such 
constructs are known in the' art. 

A heterologous gene is a gene which is not normally 
found in the T-DNA and which is also not normally 
found in the DNA of the strain of Gramineae which is 
to be transformed. Heterologous genes may be isolated 
from prokaryotic and eukaryotic sources, including 
strains of Gramineae other than the one to be trans- 
formed. Of particular interest are those heterologous 
genes which confer agronomically significant traits on 
plants containing them. 

The heterologous gene is flanked by a transcription 
unit containing, e.g., promotors and terminators, which 
allow for expression of the heterologous gene in the 
strain of Gramineae to be transformed. The heterolo- 
gous gene-transcription-unit ^construct is flanked by the 
border sequences. Any T-DNA border sequence, native 
or synthesized, can be used to flank the heterologous- 
gene-transcription-unit construct as long as the border 
sequence functions to integrate the heterologous gene 
into the cell genome of the strain of Gramineae to be 
transformed. The genetically-engineered T-DNA is 
linked to a DNA fragment containing a replicon that is 
functional in Agrobacterium to form a vector. 

After the seedlings are inoculated with the vir+ A. 
tumefaciens, they are incubated until transformation has 
taken place at which time the seedlings are planted and 
allowed to grow at least until such time as they have 
produced pollen. It is interesting to note that using the 60 
method of the invention, no tumorous growth of any 
kind, including "crown galls, calli or tumorous over- 
growths, has been observed, even on the original inocu- 
lated seedling. 

By inoculating the seedlings in the preferred area, 65 
transformation of pollen has been achieved. The result- 
ing transformed pollen can be used to fertilize trans- 
formed and untransformed plants. The embryos can be 
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excised from the resultant progeny ears and grown to 
produce transformed plants. Alternatively, of course, 
the plants resulting from this mating can be allowed to 
produce seeds which are, in turn, used to grow trans- 
5 formed whole plants which produce another crop of 
seeds. Thus, future-generations can be derived from the 
original transformed seedling by sexual reproduction. 
Those skilled in the art will recognize that various and 
numerous progeny carrying the trait coded for by the 
10 exogenous DNA can be produced using known breed- 
ing techniques, 

TRANSFORMATION OF CORN 

EXAMPLE I 

A. Preparation of Bacteria 

A single colony of the A. tumefaciens strain B6 was 
inoculated into a yeast extract broth (YEB) containing 
0.1% yeast extract, 0.8% nutrient broth and 0.5% su- 
20 crose dissolved in water. The yeast extract and nutrient 
broth were purchased from Difco Laboratories, De- 
troit, Mich. The sucrose was purchased from either 
Fisher Scientific, Detroit, Mich., or Sigma, St. Louis, 
Mo. The bacteria were incubated in the YEB for 48 
25 hours at 27° C. in a shaking water bath or until such time 
as they had reached a final concentration of 3.8X10 9 
cells per milliliter. 

The B6 strain is a standard wild type strain of A. 
tumefaciens. It is virulent (vir+), and it codes for the 
30 production of lysopine dehydrogenase in suitable plant 
hosts. It was obtained from James and Barbara Lippin- 
cott, Northwestern University, Evanston, 111. Some of 
its properties have been described in Stonier, J. Bact., 
79, 889 (1960). The B6 strain is also on deposit at the 
35 American Type Culture Collection (ATCC) ; Rockville, 
Md., and it has been given accession number 23308. 

B. Preparation of Corn 

Seeds of the inbred yellow Iochief strain of corn were 
40 obtained from The Andersons, Maumee, Ohio, or from 
Botzum, 43 East Market, Akron, Ohio. This strain is a 
standard strain which is available commercially. 

The yellow Iochief seeds were sterilized using the 
following procedure.' The seeds were first immersed for 
45 two minutes in a solution containing 7 parts 95% etha- 
nol and 2.5 parts distilled water. Next, the seeds were 
incubated for five minutes in a solution of 0.5% (weight- 
/volume) HgCh in distilled water. Next, the seeds were 
washed for a total of thirty minutes in a solution of 15% 
50 (volume/volume) Clorox (Clorox is an aqueous solution 
containing 5.25% sodium hypochlorite) and 0.1% 
(volume/volume) of a liquid dishwashing detergent 
such as Palmolive or another suitable wetting agent in 
distilled water. Finally, the seeds were washed five 
55 times in sterile double distilled water. 

The sterilized seeds were placed embryo side up on 
sterile moistened Whatman No. 3 filter paper contained 
in a sterile Petri dish. The seeds were incubated in the 
covered Petri dishes in constant darkness at 25° C. for 
four days. The filter paper was kept moist throughout 
the incubation period. 

C. Inoculation of Seedlings with A. tumefaciens 

The two primary areas of rapidly dividing cells in the 
corn seedling are the rpol cap and the area extending 
from the base of the scutellar node through the mesoco- 
tyl slightly beyond the coleoptile node, the locations of 
which are depicted in FIGS. 1A and IB. The mesocotyl 
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is the area between the scutellar node and the coleoptile 
node. 

In the preferred practice of the invention, the wound 
is made in the area extending from the base of the scutel- 
lar node through and slightly beyond the coleoptile 
node because, included within the region, are tissues 
which give rise to germ line cells. In particular, found in 
this region is tissue which gives rise to the axillary pri- 
mordia which, in turn, gives rise to tillers and to the ears 
(female reproductive organs). Also, located in this re- 
gion is the apical meristem which gives rise to the tassle 
which, in turn, gives rise to the pollen (the male repro- 
ductive organs). By inoculating the seedling in this 
preferred area, transformation of germ line cells has 
been obtained. 

Accordingly, a total of four wounds were made on 
the surface of each of the germinating corn seedlings 
prepared in part C of this example in an area which 
extends from the base of the scutellar node through and 
slightly beyond the coleoptile node. The wounds were 
made by visualizing a line which bisects this area longi- 
tudinally (the midline) as the seedling is viewed from 
the front as in FIG. IB. The cuts were made perpendic- 
ular to the midline, from the midline to the outside edge 
of the seedling and from the front surface of the seed- 
ling, when the seedling is viewed as in FIG. IB, through 
all of the tissue in the area where the cut is made. Thus, 
when a cut is made in the area of the scutellar node, the 
cut is made from the front surface through all of the 
tissue into the scutellum; and, when a cut is made in the 
mesocotyl, the cut is made from the front surface com- 
pletely through all of the mesocotyl tissue. The four 
wounds are indicated by the numeral 1 in FIG. IB. 

The wounds were inoculated by dripping a total of 
100 ul of a 10 9 cells/milliliter suspension of A. tumefaci- 
ens strain B6 in YEB, cultured as described above in 
part A, onto the four wounds. As a control, some seed- 
lings were inoculated with 0.9% NaCl (saline). After 
receiving the inoculum, the seedlings were placed em- 
bryo side up on a layer of Bactoagar contained in a Petri 40 
dish, at 5 seedlings per dish. The Petri dish contained 20 
milliliters of sterile Bactoagar (purchased from Difco 
Laboratories) at a concentration of 20 grams/liter in 
distilled water. The covered Petri dishes were incu- 
bated at IT C. in constant darkness for an additional 45 
7-14 days. 

D. Assays 

1. Assay For Enzyme Activity in Seedlings: At the 
end of the 7-14 day incubation period, the seedlings 50 
were homogenized in a 0.1 M Tris-HCl buffer, 8.0, con- 
taining 0.5M sucrose, 0.1% l (weight/volume) ascorbic 
acid and 0.1% (weight/volume) cysteine-HCl, using a 
Wheaton tissue grinder, until the homogenate had a 
homogeneous consistency. Since the seedlings were 55 
grown in the dark, pigment formation was retarded, and 
cell walls remained unusually soft. Thus, the cells of the 
seedlings broke open easily. Next, the homogenates 
were spun in a Fisher Microfuge at 13,000 xg for two 
minutes to obtain cell-free extracts. 60 

A portion of the cell-free extract was added to an 
equal volume of a reaction medium designed to detect 
lysopine dehydrogenase activity. This reaction medium 
consisted of 30 mM L-arginine, 75 mM pyruvate and 20 
mM NADH dissolved in 0.2M sodium phosphate 65 
buffer, pH 7.0. The enzyme reaction was allowed to 
proceed at room temperature for the times indicated 
below. 



10 



15 



20 



25 



30 



35 



The products of the enzyme reaction were separated 
electrophoretically on Whatman 3MM paper. At the 
start of the enzyme reaction period (time zero), a 5 ul 
sample of the reaction mixture was spotted at the anodal 
site on the paper and dried. After 15 hours of reaction, 
another 5 ul sample of the reaction mixture was spotted 
on the paper and dried. Finally, a 5 ul sample of a 100 
ug/ml solution of synthetic octopine purchased from 
Calbiochem, Division of American Hoescht, La Jolla, 
Calif., and a 5 ul sample of 100 ug/ml solution of syn- 
thetic nopaline purchased from Sigma, St. Louis, Mo., 
were spotted on the paper and dried. 

Electrophoresis was performed in a formic acid 
(90.8%)/glacial acetic acid/water (5:15:80; volume/- 
volume) solution, pH 1.8, for 2.5 hours at 450 volts. The 
paper was dried and then stained by dipping it into a 
solution containing one part 0.02% (weight/volume) 
phenanthrenequinone in absolute ethanol plus one part 
10% (weight/volume) NaOH in 60% (volume/volume) 
ethanol. After drying, the spots were visualized under a 
long-wave ultraviolet lamp (366 nm). 

The results of the electrophoresis of the products 
produced by adding the cell-free extracts of B6- 
inoculated seedlings to lysopine dehydrogenase reac- 
tion medium are shown in FIG. 2A. In that figure, lane 
1 contains a sample of the reaction mixture produced by- 
adding a portion of the cell-free extract of ten B6- 
inoculated seejdlings to an equal volume of the lysopine 
dehydrogenase reaction medium at time zero, lane 2 
contains the product produced by incubating a portion 
of the cell-free extract of ten B6-inoculated seedlings 
with an equal ^volume of lysopine dehydrogenase reac- 
tion medium for fifteen hours, lane 3 contains the syn- 
thetic octopine, lane 4 contains the product produced 
by adding a portion of the cell-free extract of ten saline- 
inoculated seedlings to an equal volume of lysopine 
dehydrogenase reaction medium at time zero, lane 5 
contains the product produced by incubating a portion 
of the cell-free extract of ten saline-inoculated seedlings 
with an equal volume of lysopine dehydrogenase reac- 
tion medium for fifteen hours, and lane 6 contains the 
synthetic nopaline. 

The results shown in FIG. 2 A demonstrate that octo- 
pine production is. caused by cell -free extracts of B6- 
inoculated corn seedlings, lane 2, but that no such pro- 
duction occurs in the saline control, lane 5. Further- 
more, the amount of octopine produced, as measured by 
an increase in phenanthrenequinone fluorescence, in- 
creases in proportion to the time of incubation. While 
no octopine can be detected at time zero, lane 1, it is 
clearly present after fifteen hours of incubation, lane 2. 
Such results are in accord with the proposition that the 
reaction is enzyme catalyzed and that the enzyme ex- 
tracted from the B6-infected corn seedlings is lysopine 
dehydrogenase. Since only transformed plant tissues are 
known to express the opine synthase genes, these results 
are also in accord with the proposition that the corn 
seedlings have been transformed by infection with the 
vir + A. tumefaciens strain B6. 

2. Assay For Substrate Specificity: Lysopine dehy- 
drogenase catalyzes the synthesis of octopine but not of 
nopaline. In FIG. 2B the results of the electrophoresis 
of the products produced by adding extracts of seed- 
lings to a reaction medium containing reagents which 
allow for the detection of nopaline dehydrogenase en- 
zyme activity are presented. The nopaline dehydro- 
genase reaction medium consists of 60 mM L-arginine. 
60 mM a-ketoglutarate and 16 mM NADH dissolved in 
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0.2M sodium phosphate buffer, pH 7.0. The a-ketogluta- 
rate can be used as a substrate only by n opaline dehy- 
drogenase. Lanes 1-6 in FIG. 2B are the same as lanes 
1-6 of FIG. 2A except that the reaction medium is the 
nopaline dehydrogenase reaction rrtedium. Thus, FIG. 5 
2B shows that cell-free extracts of seedlings inoculated 
with strain B6 cannot use a-ketoglutarate in the conden- 
sation reaction with arginine to produce nopaline. This 
substrate specificity confirms the identity of the enzyme 
produced by seedlings transformed by strain B6 as lyso- 10 
pine dehydrogenase. 

3. Transformation Efficiency: To address this issue, 
assays of single seedlings were performed, and the num- 
ber of cell-free extracts of single seedlings which pro- 
duced octopine were determined. The results are shown 
in FIG. 3A where all ten lanes contain the product 
produced by incubating the cell-free extracts from sin- 
gle B6-inoculated seedlings with lysopine dehydro- 
genase reaction medium for four hours. As shown in 2Q 
FIG. 3A, eight of the ten lanes have a spot which stains 
with phenanthrenequinone and co-migrates with the 
octopine standard. Thus, the transformation frequency 
for this experiment was 80%. 

4. Controls: To rule out the possibility that the octo- 25 
pine produced was a consequence of some secondary 
and uninteresting event, several additional controls 
were done. As shown in FIG. 4A, electrophoresis of 
cell-free sonicates of a suspension of B6 cultured for 48 
hours, as described above in part A, failed to disclose 30 
any stored octopine (lanes 1 and 2). Furthermore, when 
these sonicates were mixed with lysopine dehydro- 
genase reaction medium and incubated for four hours, 
no lysopine dehydrogenase activity could be detected. 
See FIG. 4B, lanes 1, 2 and 3 where the products of this 35 
incubation were electrophoresed. Thus, lysopine dehy- 
drogenase activity is not found in bacterial cultures 48 
hours old. Also, lysopine dehydrogenase reaction me- 
dium alone did not contain any octopine. See FIG. 4B, 
lane 4, where this reaction medium alone was electro- 40 
phoresed. Similarly, no evidence is found for the exis- 
tence of this dehydrogenase in uninfected corn seed- 
lings. See FIG. 4C. which shows an electrophoretogram 

in which lanes 1-5 contain the product produced by 
incubating cell-free extracts of uninfected single seed- 45 
lings for four hours with lysopine dehydrogenase reac- 
tion medium. These results confirm that the presence of 
lysopine dehydrogenase activity in the cell-free extracts 
of B6-inoculated seedlings is due 10 the transformation 
of the seedlings and not to some secondary or uninter- 50 
esting event. 

EXAMPLE II 

A. Preparation of Bacteria 

y 55 
A single colony of the A. tumefaciens strain C58 was 
inoculated into YEB, and the bacteria were incubated as 
described above in Example I, part A, for strain B6. The 
C58 strain is a standard wild type strain of A. tumefaci- 
ens. It is vir + , and it codes for the production of nopa- 60 
line dehydrogenase in suitable plant hosts. It was ob- 
tained from James and Barbara Lippincott, Northwest- 
ern University, Evanston, 111., or from Clarence Kado, 
University of California, Department of Plant Pathol- 
ogy, Davis, Calif. It has been described in Depicker et 65 
al, Plasmid, 3, 193 (1980) and Kao, et al, Molec. Gen. 
Genet., 188, 425 (1982). Strain C58 is also on deposit at 
the ATCC and has been given accession number 33970. 
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B. Transformation of Corn 

Seeds of the inbred yellow Iochief strain of corn were 
sterilized, germinated, inoculated and further incubated 
for 7-14 days as described above in Example I, parts B 
and C, except that the corn seedlings were inoculated 
with strain C58 rather than strain B6. 

C. Assays 

1. Assay For Enzyme Activity in Seedlings: At the 
end of the 7-14 day incubation period, cell-free extracts 
of the seedlings were prepared and were assayed for 
enzymatic activity as described above in Example I, 
part D, except that the reaction medium used was the 
one designed to assay for nopaline dehydrogenase activ- 
ity. As set forth above in Example I, part D, this reac- 
tion medium consisted of 60 mM L-arginine, 60 mM 
a-ketoglutaric acid and 16 mM NADH dissolved in 
0.2M sodium phosphate buffer, pH 7.0. 

The results are shown in FIG. 2B. Lane 8 in FIG. 2B 
contains the product produced by mixing a portion of 
the cell-free extract of ten C58-inoculated seedlings 
with an equal volume of nopaline dehydrogenase reac- 
tion medium at time zero, and lane 7 contains the prod- 
uct produced by incubating a portion of the cell-free 
extract of ten C58-inoculated seedlings with an equal 
volume of nopaline dehydrogenase reaction medium for 
fifteen hours. Thus, FIG. 2B shows that nopaline is 
produced by the cell-free extracts of the C58-inoculated 
corn seedlings, lane 7, but that no such production was 
produced by the saline control, lane 5. Once again, the 
amount of nopaline produced, as measured by an in- 
crease in phenanthrenequinone fluorescence, increases 
in proportion to the time of incubation. While no nopa- 
line can be detected in a reaction mixture at time zero, 
lane 8, it is clearly present after fifteen hours of incuba- 
tion, lane 7, and such results are in accord with the 
proposition that the reaction is enzyme-catalyzed and 
that the enzyme extracted from C58-infected seedlings 
is nopaline dehydrogenase. Since only transformed 
plant tissues are known to express the opine synthase 
genes, these results are also in accord with the proposi- 
tion that the corn seedlings have been transformed by 
infection with the vir+ A. tumefaciens strain C58. 

2. Pyronopaline Assay: The synthesis of nopaline by 
extracts from seedlings transformed with C58 has been 
confirmed by criteria other than electrophoretic mobil- 
ity. When nopaline is eluted from a paper chromato- 
gram with water, evaporated by the application of a 
vacuum to reduce the volume and reacted with an equal 
volume of hot (100° C.) 2M acetic acid for one hour, 
pyronopaline is formed. The conversion reaction is 
diagnostic for nopaline and no other opine. 

In FIG. 2C, lane 1 is pyronopaline produced from 
synthetic nopaline by treating the synthetic nopaline 
with hot 2M acetic acid as described above, lane 2 is 
synthetic nopaline (some of which converts spontane- 
ously to pyronopaline), lane 3 is the product produced 
by incubating a portion of the cell-free extract from ten 
C58-inoculated seedlings with an equal volume of the 
nopaline dehydrogenase reaction medium for fifteen 
hours, and lane 4 is this product treated with hot 2M 
acetic acid as described above. As can be seen, the 
product produced by incubating the cell-free extract 
from C58-inoculated seedlings with the nopaline dehy- 
drogenase reaction mixture, lane 3, is totally converted 
to pyronopaline, lane 4, confirming that C58-inoculated 
seedlings produce nopaline dehydrogenase. 
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3. Catabolism of Nopaline: Finally, if nopaline is incu- 
bated with C58 and serves as its sole energy source, the 
bacteria will grow as this compound is degraded. How- 
ever, B6 grown on a medium containing nopaline as the 
sole energy source will not break down the nopaline 
and will not divide since B6 lacks the specific opine 
oxidase which is necessary for the catabolism of nopa- 
line. 

An assay based on this principle was used to confirm 
the nopaline identity of the product produced by the 
cell-free extracts of C58-inoculated seedlings. The re- 
sults of this assay are shown in FIG. 2D where lane 1 
contains synthetic nopaline, lane 2 contains the product 
produced by incubating a portion of the cell-free extract 
of ten C58-inoculated seedlings with an equal volume of 
the nopaline dehydrogenase reaction medium for fifteen 
hours which has been electrophoresed, eluted with 
water and evaporated, as described above, in connec- 
tion with the pyronopaline assay, and incubated with 
strain B6 for twenty-four hours, and lane 3 contains the 
product produced by incubating a portion of the cell- 
free extract of ten C58-inoculated seedlings with an 
equal volume of nopaline dehydrogenase reaction me- 
dium for fifteen hours which has been electrophoresed, 
eluted, evaporated and incubated with strain C58 for 
twenty-four hours. The product produced by the cell- 
free extracts of the C58-inoculated seedlings was con- 
sumed by strain C58, lane 3, but was not consumed by 
strain B6, lane 2, confirming that the product is nopa- 
line. 

4. Efficiency of. Transformation: The efficiency of the 
transformation of inbred yellow Iochief corn using A. 
tumefaciens strain C58 was also investigated. The results 
are shown in FIG. 3G where all ten lanes contain the 
product produced by incubating cell-free extracts from 
single C58-inoculated seedlings with nopaline dehydro- 
genase reaction medium for six hours. As can be seen, 9 
out of 10 seedlings were transformed. Additional single 
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of the seedlings and not to some secondary and uninter- 
esting event. 

EXAMPLE III 

A. Transformation of Corn 

Strain C58 was cultured as described above in Exam- 
ple II, part A, and inbred yellow Iochief corn was steril- 
ized, germinated, inoculated and incubated as described 
above in Example II, part B. After 7 days of incubation, 
the infected seedlings were planted in pots containing 
potting soil. 

B. Assays 

1. Assay For Enzymatic Activity in Leaves of Em- 
bryonic Origin: Three weeks after planting, three leaves 
of embryonic origin from three separate plants were 
assayed for the presence of nopaline dehydrogenase 
activity. The leaves chosen for the assay were the first 
leaves from the base of the plant which had not en- 
larged. The embryonic leaves are derived from differ- 
entiated structures present in the seedlings at the time of 
inoculation, but these differentiated structures are not 
located in the inoculated area. 

To perform the assay, the three embryonic leaves 
were individually homogenized in Tris-HCl buffer, 
centrifuged and assayed for enzymatic activity as de- 
scribed above in Example II, part C, for the seedlings. 
The results of the electrophoresis of the product pro- 
duced by incubating the cell-free extracts of the three 
embryonic leaves with nopaline dehydrogenase reac- 
tion medium for twelve hours are shown in FIG. 5A 
where lanes 1, 2 and 3 contain the products of this incu- 
bation. As can be seen from FIG. 5A, none of the cell- 
free extracts of the embryonic leaves contained nopa- 
line dehydrogenase activity demonstrating that these 
leaves had not been transformed by the inoculation 
procedure. 

2. Assay For Enzymatic Activity in Leaves of Meri- 



seedling assays were done using seedlings transformed $q stematic Origin: Leaves derived from the meristem (all 

with either B6 or C58. Of the 150 total single seedling leaves besides embryonic leaves) were assayed for the 

assays done with either strain B6 or C58, sixty percent presence of nopaline dehydrogenase 7 weeks after the 

were transformed. planting of the seedlings. The meristem is tissue com- 

5. Controls: To rule out the possibility that the nopa- posed of small, rapidly dividing, undifferentiated cells 

line produced was a consequence of some secondary 45 which are capable of dividing to produce organs and 



and uninteresting event, several additional controls 
were done. As shown in FIG. 4A, electrophoresis of 
cell-free sonicates of a suspension of C58 cultured for 48 
hours, as described above in Example II, part A, failed 
to disclose any stored nopaline, lanes 3 and 4. Further- 
more, when these sonicates were mixed with nopaline 
dehydrogenase reaction medium and incubated for four 
hours at 25" C, no nopaline dehydrogenase activity 
could be detected. See FIG. 4B, lanes 6 and 7 where the 
products of this incubation were electrophoresed. Thus, 55 
nopaline dehydrogenase activity was not found in bac- 
terial cultures 48 hours old. Also, nopaline dehydro- 
genase reaction medium alone did not contain any nopa- 
line. See FIG. 4B, lane 5, where this reaction medium 
alone was electrophoresed. Similarly, no evidence is 
found of this dehydrogenase in uninfected corn seed- 
lings. See^lG. 4C which shows an electrophoretogram 
in which lanes 6-10 contain the product produced by 
incubating cell-free extracts of uninfected single seed- 
lings for four hours with nopaline dehydrogenase reac- 65 
lion medium. These results confirm that the presence of 
nopaline dehydrogenase activity in the cell-free extracts 
of C58-inoculated seedlings is due to the transformation 



other differentiated tissue. Meristematic tissue which 
differentiates into leaves is located in the inoculated 
area. 

To perform the assay, sections were dissected out of 
50 the meristem-deriyed leaves, and the dissected sections 
from each leaf were homogenized together in Tris-HCl 
buffer, centrifuged and assayed for enzymatic activity 
as described above in Example II, part C, for seedlings. 
The sections of the leaves which were used for the assay 
are indicated by the numerals 2 and 3 in FIG. 5B. The 
section designated by the numeral 3 was dissected by 
cutting along lines 6 and 7. Line 7 is coincident with the 
midrib 4 of the leaf. Section 3 is located at the base of 
the leaf which is normally attached to the plant. The 
60 base of the leaf is the growing end of the leaf, and this 
area contains the newest cells on the leaf. The sections 
designated by the numeral 2 were dissected by cutting 
along line 5 which is perpendicular to the midrib 4. 
Sections 2, 2 are at the tip of the leaf, and they contain 
the oldest cells on the leaf. Each section 2 or 3 consti- 
tutes about 1/6 of the leafs surface area. 

The results of the electrophoresis of the product pro- 
duced by incubating the cell-free extracts of these sec- 
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tions of four leaves of meristematic origin taken from 
four separate plants with nopaline dehydrogenase reac- 
tion medium are shown in FIG. 5C. In FIG. 5C, lane 1 
contains nopaline dehydrogenase reaction medium, and 
lanes 2-5 contain the products produced by incubating 5 
the cell-free extracts of the leaves with nopaline dehy- 
drogenase medium for twelve hours. As shown there, 
cell-free extracts of 3 out of 4 leaves produced nopaline 
demonstrating that they contained nopaline dehydro- ]0 
genase activity. Since these leaves are derived from 
meristematic tissue by cell division and differentiation, 
these results demonstrate that the cells in the inoculated 
area of the seedling were able to pass on the ability to 
synthesize nopaline dehydrogenase to future genera- 15 
tions of corn cells. Thus, these results demonstrate that 
transformation of cells in the area inoculated and of 
cells derived from these cells has taken place. 

3. Assay For Enzymatic Activity in Pollen: Sixty 
days after the planting of the seedlings, samples of the 20 
pollen of two plants were individually assayed for the 
presence of nopaline dehydrogenase. To perform the 
assay, 0.5 to 1.0 milliliter of pollen containing approxi- 
mately 5-10X 10 5 grains of pollen was homogenized in 25 
Tris-HC) buffer, centrifuged and assayed for enzymatic 
activity as described, above in Example II, part C, for 
seedlings. The results of the electrophoresis of the prod- 
ucts produced by incubating the cell-free extracts of the 
pollen from the two plants with nopaline dehydro- 30 
genase reaction medium for twelve hours are shown in 
FIG. 5D where lanes 2 and 3 contain the products of 
this incubation, and lane 1 contains nopaline dehydro- 
genase reaction medium. As can be seen from that fig- 
ure, pollen from . both of the plants contained nopaline 35 
dehydrogenase activity. Since the pollen is derived 
from the apical meristem by cell division and differenti- 
ation, these results, like the results for the leaves of 
meristematic origin above, demonstrate that transfor- 
mation has taken place. 

4. Assay for Enzymatic Activity in Seedlings De- 
rived from Transformed Pollen: Pollen from the two 
transformed plants identified above in part B3 is used to 
fertilize ears on plants grown from uninfected yellow 45 
Iochief corn seed. The Fi seeds produced by the fertil- 
ized plants as a result of this mating are harvested and 
are germinated and incubated as described above in 
Example 1, parts B and C, except that the seedlings are 
not inoculated. After 7-14 days of incubation, the seed- 50 
lings are assayed for enzymatic activity as described 
above in Example II, part C. Cell-free extracts of the 
seedlings are found to produce nopaline showing that 
this Fi generation of seedlings is transformed. 

5. Assay for Enzymatic Activity in Leaves of Embry- 55 
onic Origin Taken from Plants Derived From Trans- 
formed Pollen: Seeds produced by the mating described 
above in part B4 of this example are harvested and are 
germinated and incubated as described above in Exam- 
ple I, parts B and C, except that the seedlings are not 
inoculated. After 7-14 days incubation, the seedlings 
are planted as described above in part A of this example. 
Three weeks after planting, embryonic leaves are as- 
sayed for nopaline dehydrogenase activity as described 65 
above in part Bl of this example, and cell-free extracts 
of the embryonic leaves produce nopaline showing they 
are transformed. 
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EXAMPLE IV 
A. Transformation of Corn 

Yellow Iochief corn was sterilized, germinated, inoc- 
ulated and further incubated for 7-14 days as described 
above in Example I, Parts B and C, except that separate 
groups of corn seedlings were inoculated with either A. 
tumefaciens strain A 348, strain JK 195 or strain 238MX. 

The A348 strain carries the broad host range plasmid 
pTiA6NC from strain A6NC It is vir+, and it codes for 
the production of lysopine dehydrogenase in suitable 
plant hosts. Strain A6NC and plasmid pTiA6NC are 
described by Sciaky et al. in Plasmid, 1, 238 (1977). The 
A348 used was obtained from Eugene Nester, Univer- 
sity of Washington, Department of Microbiology and 
Immunology, Seattle, Wash. It was cultured as de- 
scribed above in Example I, part A, for strain B6. 

Strains JK 195 and 238MX each carry a mutation in 
the critical vir region and are vir~. They cannot, there- 
fore, convey the necessary portion of the Ti plasmid to 
their respective hosts. Consequently, plant extracts 
made from material inoculated with these bacteria 
would not be expected to produce any opine when 
added to the appropriate reaction medium. 

The 238MX is similar in background and source to 
the A348 strain, but has the bacterial transposon Tn3 
inserted in the vir .region rendering it vir~. Strain 
238MX was obtained from Eugene Nester (address 
given above). It was incubated as described in Example 
I, part A, for strain B6, and it was selected on YEB 
containing 100 ug/milliliter of carbenicilin. 

The JK 195 strain is a vir - mutant derived from C58. 
It has the bacterial transposon Tn5 inserted in comple- 
mentation group VI of the vir region. A detailed de- 
scription of strain JK 195 may be found in Kao et al., 
Mol Gen. Genet. 188, 425 (1982) and Lundguist et al., 
Mol. Gen. Genet. 193, 1 (1984). The JK 195 used was 
obtained from Clarence Kado (address given above). It 
was also incubated as described in Example I, part A, 
and it was selected on YEB containing 50 ug/milliliter 
rifampicin. 

B. Assays 

1. Assay For Enzyme Activity in Seedlings: As is 
shown in FIGS. 3B and 3D, cell-free extracts of seed- 
lings inoculated with the 238MX strain or the JK 195 
strain do not produce opines. In FIG. 3B, lanes 6-10 
contain the product produced by incubating the cell- 
free extracts of 238MX-inoculated single seedlings with 
the lysopine dehydrogenase reaction medium for four 
hours. As can be seen, none of the seedlings was trans- 
formed since no octopine was synthesized by the cell- 
free extracts. In FIG. 3D, all ten lanes contain the prod- 
uct produced by incubating the cell-free extracts of JK 
195-inoculated single seedlings with nopaline dehydro- 
genase reaction medium for six hours. Again, none of 
the seedlings was transformed since no nopaline was 
produced. , 

However, the A 348 strain is competent with respect 
to transformation. In FIG. 3B, lanes 1-5 contain the 
product produced by incubating the cell-free extracts of 
A348-inoculated single seedlings with lysopine dehy- 
drogenase reaction medium for four hours. As can be 
seen, lysopine dehydrogenase was found in the cell-free 
extract of one out of five single seedlings inoculated 
with A348 showing that the seedling was transformed. 
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Thus, only those vir+ A. tumefaciens strains capable 
of transferring T-DNA transform corn seedlings. Those 
which carry mutations in the vir region and which are, 
therefore, transfer minus do not provoke opine synthase 
activity in extracts made from infected plants, 

EXAMPLE V 

A single colony of the A. tumefaciens strain T37 was 
inoculated into YEB, and the bacteria were incubated as 
described above in Example I, Part A, for strain B6. 10 

The T37 strain is a standard wild type strain of A. 
tumefaciens. It is vir+, and it codes for the production of 
nopaline dehydrogenase in suitable plant hosts. It was 
originally obtained from John Kemp, University of 
Wisconsin, Department of Plant Pathology, Madison, 15 
Wis., and can currently be obtained from Anne C. F. 
Graves, University of Toledo, Dept. of Biology, To- 
ledo, Ohio. It has been described in Turgeon et al , 
PNAS, 73, 3562 (1976) and in Sciaky et al , Plasmid, 1, 
238 (1978). 20 

Seeds of the inbred yellow Iochief strain of corn were 
sterilized, germinated, inoculated and further incubated 
for 7-14 days as described above in Example I, parts B 
and C, except that the corn seedlings were inoculated 
with strain T37 rather than strain B6. 25 

At the end of the 7-14 day incubation period, the 
seedlings were assayed for enzyme activity as described 
above in Example II, part C. Using this procedure, 
nopaline production by the cell-free extracts of T37- 
inoculated corn seedlings was demonstrated showing 30 
that the seedlings were transformed. 

EXAMPLE VI 

A single colony of the A. tumefaciens strain C58 was 
inoculated into YEB, and the bacteria were incubated as 35 
described above in Example II, part A. Seeds of the 
inbred PA91 strain of com were sterilized, germinated, 
inoculated and further incubated for 7-14 days as de- 
scribed above in Example I, parts B and C, for the in- 
bred yellow Iochief strain. The PA91 strain is a stan- 40 
dard inbred strain of corn which is commercially avail- 
able. It was obtained from Jean Roberts, Eli Lilly and 
Co., Greenville, Ind. 

At the end of the 7-14 day incubation period, the 
seedlings were assayed as described above in Example 45 
II, part D. As shown in FIG. 7, nopaline production by 
the cell-free extracts was demonstrated showing that 
the PA91 strain of corn had been transformed. In FIG. 
6, lanes 1-10 contain the product produced by incubat- 
ing the cell-free extracts of single C-58-inoculated seed- 50 
lings with nopaline dehydrogenase reaction medium for 
six hours. As can be seen, all ten of the ceil-free extracts 
produced nopaline showing that the seedlings were 
transformed. 



EXAMPLE VII 

A single colony of the A. tumefaciens strain B6 was 
inoculated into a yeast extract broth, and the bacteria 
were incubated as described above in Example I, part 
A. Seeds of the inbred yellow Iochief strain of com 
were sterilized, germinated, inoculated and further in- 
cubated for 7-14 days as described above in Example I, 
parts B and C, except that the seeds were germinated as 
follows. After being sterilized, the seeds were soaked 
for about 12 hours in sterile distilled water. They were 
then incubated on sterile moistened Whatman No. 3 
paper in sterile Petri dishes as described above, but they 
were only incubated for 1.5 to 2.0 days since soaked 
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seeds germinate in a shorter time than do unsoaked 
seeds. 

At the end of the 7-14 day incubation period, the 
seedlings were assayed for enzyme activity as described 
above in Example I, part D. Using this procedure, octo- 
pine production by cell-free extracts of the infected 
seedlings was demonstrated showing that the seedlings 
were transformed. 

EXAMPLE VIII 

A single colony of the A. tumefaciens strain LBA 
4013 was inoculated into YEB, and the bacteria were 
incubated as described above in Example 1, part A, for 
strain B6. The LBA 4013 strain is a mutant strain de- 
rived from A. tumefaciens strain Ach5. LBA 4013 con- 
tains the wild type Ti plasmid pTiAchS which is vir+, 
and LBA 4013 codes for the production of lysopine 
dehydrogenase in suitable plant hosts. LBA 4013 was 
obtained from Clegg Waldron, Eli Lilly and Co., Indi- 
anapolis, Ind. It has been described by Marton et al., in 
Nature, 277, 129 (1979). 

Seeds of the inbred yellow Iochief strain of corn were 
sterilized, germinated, inoculated and further incubated 
for 7-14 days as described above in Example I, parts B 
and C, except that the com seedlings were inoculated 
with strain LBA 4013 rather than strain B6. 

At the end of the 7-14 day incubation period, the 
seedlings were assayed for enzyme activity as described 
above in Example I, part D. Using this procedure, octo- 
pine production by cell-free extracts of LBA 4013- 
transformed seedlings was demonstrated showing that 
the seedlings were transformed. 

EXAMPLE IX 
A. Preparation of Bacteria 

A single colony of the A. tumefaciens strain CA19 
was inoculated into YEB, and the bacteria were incu- 
bated as described ahove in Example I, Part A, for 
strain B6. The CA19 strain is derived from strain LBA 
4013 and contains the pTiAchS plasmid of LBA 4013 
which is vir+ as described above in Example VIII, and 
strain CA19 codes for the production of lysopine dehy- 
drogenase in suitable plant hosts. 

Strain CA19 also contains the micro-Ti plasmid 
pCEL44. Micro-plasmid pCEL44 comprises a con- 
struct consisting of the gene coding for hygromycin 
phosphotransferase (aphlV) inserted between the 5' 
promoter and associated amino terminal region-encod- 
ing sequence of an octopine synthase gerte and the 3' 
terminator sequence of a nopaline synthase gene. This 
construct is assembled between T-DNA border frag- 
ments in broad -host-range vector pKT210. Micro-plas- 
mid pCEL 44 is capable of transforming plant cells and 
rendering them resistant to hygromycin. 

Strain CA19 is prepared as follows. 

1. Culture of Escherichia coli RRlAM15/pCEL30 
and Isolation of Plasmid pCEL30: Plasmid pCEL30 
comprises the right-hand border sequence of the 
T-DNA and 5' end of the octopine synthase (ocs) gene 
derived from plasmid pTiA66. A linker containing a 
unique Bglll site is fused in the 1 1th codon of the ocs 
gene. Attached to the linker are the termination and 
polyadenylation signals of the nopaline synthase gene of 
plasmid pTiC58. Attached to these sequences is a se- 
quence which includes the left-hand border sequence of 
the T-DNA derived from plasmid pTiA66. A restric- 
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tion site and function map of plasmid pC£L30 is given 
in FIG. 8. 

Plasmid pCEL30 can be conventionally isolated from 
Escherichia coli K12 RR 1 AM 1 5/pCEL30. E. coii 
RRlAM15/pCEL30 is on deposit at. the Northern Re- 5 
gional Research Laboratory (NRRL), Peoria, 111. 
61604, and has accession number NRRL B- 159 15. 

The isolation is performed as follows. E. coli 
RRlAM15/pCEL30 is grown in 750 ml of L medium 
(10 g/1 caesin hydrolysate, 5 g/1 yeast extract, 5 g/1 10 
NaCl, 1 g/1 glucose, pH 7.4) containing ampicillin at 50 
mg/ml according to conventional microbiological pro- 
cedures. The culture is harvested after 24 hours incuba- 
tion at 37* C. with vigorous shaking. 

The culture is centrifuged, and the cell pellet is resus- 15 
pended in 50 ml freshly-prepared lysis buffer (50 mM 
Tris-HCl, pH 8, 10 mM EDTA, 9 mg/ml glucose, 2 
mg/ml lysozyme). After 45 minutes incubation on ice, 
the suspension is mixed with 100 ml of a solution that is 
0.2N NaOH and 1 % SDS. The suspension is then kept 20 
on ice for a further 5 minutes. Another 90 ml of 3M 
sodium acetate is added, and the mixture is maintained 
on ice for an additional 60 minutes. 

Cell debris is removed by centrifugation, and the 
supernatant is mixed with 500 ml ethanol. After 2 hours 25 
at —20* C, nucleic acid is pelleted by centrifugation 
and is resuspended in 90 ml of 10 mM Tris-HCl, pH 8, 
10 mM EDTA. 

The nucleic acid solution is mixed with 90 gm cesium 
chloride, and 0.9 ml of a solution containing 10 mg/ml 30 
of ethidium bromide. This mixture is then centrifuged at 
40,000 rpm for 24 hours to purify the plasmid DNA. 
The plasmid DNA band is recovered and is then recen- 
trifuged at 40,000 rpm for 16 hours. The plasmid DNA 
band is again recovered and* freed of cesium chloride 35 
and ethidium bromide by conventional procedures. It is 
next precipitated with 2 volumes of ethanol containing 
90 g/1 ammonium acetate. The pelleted DNA is dis- 
solved in TE buffer (10 mM Tris-HCl, pH 8, 1 mM 
EDTA) at a concentration of 0.2 mg/ml. 40 

2. Culture of E. coli JA221/pOW20 and Isolation of 
Plasmid pOW20: E. coli JA221/pOW20 is grown as 
described for £. coli RRlAM15/pCEL30 in part Al of 
this example, and plasmid pOW20 is prepared as de- 
scribed for plasmid. pCEL30 in part Alof this example. 45 

3. Construction of E. coli RRlAM15/pCEL40: Five 
jig of plasmid pCEL30 DNA are digested with 50 units 
of Bglll restriction enzyme in a 150 /il reaction mixture 
of the composition recommended by the enzyme manu- 
facturer. Restriction and other enzymes can be readily 50 
obtained from the following sources: 

Bethesda Research Laboratories, Inc. Box 6010 
Rockville, Md. 20850 

Boehringer Mannheim Biochemicals 7941 Castleway 
Drive P.O. Box 50816 Indianapolis, Ind. 46250 55 

New England Bio Labs., Inc. 32 Tozer Road Bev- 
erly, Mass. 01915 
Digestion is allowed to proceed for 90 minutes at 37* C. 

The reaction mixture is first mixed with 8.75 u.1 of 
0.5M Tris-HCl, pH 8, 1 mM EDTA and then with 1.25 60 
units of calf intestjnal phosphatase (which can be pur- 
chased from Boehringer Mannheim) and incubated at 
37* C. for 15 minutes. The mixture is next extracted 
with buffered phenol, then with ether and is precipi- 
tated with 2 volumes of ethanol containing ammonium 65 
acetate. After 30 minutes at -70' C, the DNA is pel- 
leted and redissolved in TE buffer at a concentration of 
10 Mg/ml. 
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About 20 jig of plasmid pOW20 DNA is digested 
with the restriction enzymes BamHI and Bglll accord- 
ing to the enzyme manufacturer's recommended proce- 
dures to obtain the aphl V gene. The aphlV gene is an E. 
coli gene which makes plants containing the gene resis- 
tant to hygromycin. 

The DNA fragments resulting from this digestion are 
fractionated by conventional methods of agarose gel 
electrophoresis and isolated by entrapment on a piece of 
NA-45 DEAE paper (Schleicher & Schuell , Inc., 
Keene, N.H. 03431) inserted into the gel during electro- 
phoresis. DNA is eluted from the paper by spinning the 
paper for 5 seconds with a sufficient amount of a high 
salt bufTer (1.0M NaCl; 0.1 mM EDTA; and 20 mM 
Tris, pH 8.0) to cover the paper in a microcentrifuge. 
The paper is incubated at 55-60° C. for 10-45 minutes 
with occasional swirling. The bufTer is removed, and 
the paper washed with about 50 u.1 of buffer. The DNA 
is extracted first with phenol and then with ether and is 
resuspended in TE buffer at a concentration of about 25 
fig/ml. 

Ten ng of the phosphatased, Bglll-cut plasmid 
pCEL30 is mixed with 50 ng of the purified - 1.3 kb 
BamHI-Bglll fragment of plasmid pOW20 in a 15 fil 
ligase buffer (50 mM Tris-HCl, pH 7.6; 10 mM MgCh; 
10 mM DTT; and 1 mM ATP) containing 0.8 units of 
T4 DNA ligase (BRL). The mixture is incubated over- 
night at 15° C. 

The ligation mixture is mixed with 15 u.1 sterile 60 
mM CaCb solution. Next, 70 u.1 of a suspension of com- 
petent £. coli RR1AM15 cells, which has been stored 
20 X concentrated in 30 mM CaCh, 15% glycerol at 
— 70* C, are added. After 60 minutes on ice, the trans- 
formation mixture is heat-treated at 42* C. for 2 minutes 
and is then incubated with 0.5 ml L medium for 90 
minutes at 37" C. 

Samples of the mixture are spread on L medium con- 
taining ampicillin at 50 mg/1 and solidified with agar at 
15 g/1. These samples are then incubated overnight at 
37° C. to permit growth of colonies from transformed 
cells. 

Colonies resulting from the transformation are inocu- 
lated into 5 ml L medium containing ampicillin at 50 
mg/ml and grown overnight at 37° C. Plasmid DNA is 
prepared from 1 ml samples of these cultures by the 
procedure of Holmes & Quigley, Analytical Biochemis- 
try, 114, 193 (1981) and is redissolved in 50 u.1 of TE 
buffer. 

4. Construction of Micro-Ti Plasmid pCEL44: Since 
plasmid pCEL40 is not capable of replication in Agro- 
bacterium, the micro T-DNA of plasmid pCEL40 was 
first transferred, as an EcoRI fragment, into broad-host- 
range vector pKT210. This broad -host -range vector is 
available from Plasmid Reference Center, Stanford 
University, Palo Alto, Calif. 94305. 

Five u-g of plasmid pKT210 are digested with 50 units 
of EcoRI restriction enzyme in a 1 50 u.1 reaction of a 
composition recommended by the enzyme manufac- 
turer. After 90 minutes at 37° C, the reaction is treated 
with calf intestinal phosphatase as described above in 
part A3 of this example and is dissolved in TE buffer at 
a concentration of 10 pig/ml. 

Fifteen jil of a preparation of plasmid pCEL40 DNA, 
grown as described above in pan A3 of this example, 
are digested with 10 units of EcoRI restriction enzyme 
in a 20 u-1 reaction at 37° C. for 90 minutes and are then 
extracted with phenol, followed by extraction with 
ether. The digested DNA is precipitated with 2 vol- 
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umes of ethanol containing ammonium acetate at 
C. and is redissolved in 20 jil TE buffer. 

Ten ng of phosphatased, EcoRI-cut pKT210 are 
ligated with 5 u.1 of EcoRI-cut pCEL40 as described 
above in part A3 of this example, and transformed into 
E. coli RR1 AM 15 as described above in part A3 of this 
example. 

Transformed cells containing pCEL44 are selected 
by their ability to grow on solidified L medium contain- 
ing chloramphenicol at 10 mg/1. A restriction site and 
function map of pCEL44 is provided in FIG. 9. 

5. Conjugation of pCEL44 Into A. tumefaciens 
LBA4013 to form Strain CA19: E. coli K12 
RRlAM15/pCEL44 and E. coli pRK2013 are grown 
overnight at 37* C. on solidified L medium. A. tumefaci- 
ens LBA4013 is grown for 2 days at 28° C. on solidified 
L medium. 

One loop of E. coli K12 RRlAM15/pCEL44, one 
loop of E. coli pRK2013 and one loop of A, tumefaciens 
LBA 4013 are mixed in 1 ml of 30 mM magnesium 
sulfate solution. Next, a drop of the mixture is placed on 
solidified TY medium (5 g/1 caesin hydrolysate, 5 g/1 
yeast extract, 15 g/1 agar) and incubated at 28° C. over- 
night. 

The bacterial growth is resuspended in 3 ml of 10 mM 
magnesium sulfate solution and 0. 1 ml samples of serial 
dilutions are spread on solidified TY medium containing 
100 mg/1 nalidixic acid and 2 mg/1 chloramphenicol 
and incubated at 28° C. 

Transconjugants give rise to individual colonies after 
2 to 4 days growthY' These are inoculated singly into 25 
ml liquid TY medium cpntaining 100 mg/1 nalidixic acid 
and 2 mg/1 chloramphenicol and incubated at 28* C. 
with shaking for another 2 days. The plasmid content of 
the transconjugants is then checked by the method of 35 
Casse et al. {Journal of General Microbiology 
113:229-242; 1979), and strain CA19 containing the 
wild type pTiAchS plasmid and the pCEL44 plasmid is 
isolated. 

The CA19 used to practice the method of the present 
invention was obtained from Clegg Waldron, Eli Lilly 
and Co., Indianapolis, Ind. The preparation of strain 
CA19 has also been described in Waldron et al., Plant 
Molec, Biol., 5, 103 (198,5) which is incorporated herein 
by reference. 

B. Transformation of Corn 

Seeds of the inbred yellow Iochief strain of corn were 
sterilized, germinated, inoculated and further incubated 
for 7-14 days as described above in Example I, parts B 
and C, except that the com seedlings were inoculated 
with strain CA19 rather than strain B6. 
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C. Assays 

1. Assay of Seedlings for Lysopine Dehydrogenase 55 
Activity: At the end of the 7-14 day incubation period, 
the seedlings were assayed for enzyme activity as de- 
scribed above in Example I, part D. The results are 
shown in FIG. 6 where lanes 1-10 contain the product 
produced by incubating cell-free extracts of single 60 
CA19-inoculated seedlings with lysopine dehydro- 
genase for four hours. As can be seen there, octopine 
production in 9 out of 10 cell-free extracts of single 
CA19-inoculated corn; seedlings was demonstrated 
showing that the seedlings contained lysopine dehydro- 65 
genase and were transformed. 

2. Assay of Seedlings for Hygromycin Resistance: 
Some seedlings which were inoculated with strain 



CA19 were incubated only for 3-4 days after inocula- 
tion, at which time they were assayed for resistance to 
hygromycin as follows. The seedlings were dissected 
away from the endosperm and scutellum (see FIGS. 1A 
and IB) and cut into approximately 3 mm cross sections. 
The cross sections were cultured for three weeks on 
Duncan's medium (described by Duncan et al. in Planta. 
165, 322 (1985)) supplemented with 200 ug/ml each 
carbeniciliin (Sigma) and vancomycin (Lilly) or on 
BN4 medium (Murashige and Skoog major and minor 
salts (described in Physiol Plant, 15 473 (1962)), 4 mg/1 
2,4-dichlorophenoxy acetic acid as auxin, 9 g/1 Difco 
Bactoagar and 20 g/l sucrose) supplemented with 200 
fig/ml each carbeniciliin and vancomycin, in the dark, 
at 25° to 27* C, followed by another three-week pas- 
sage on those antibiotics. 

To test the response to hygromycin of the tissue cul- 
tures derived from the corn seedlings, either whole 
cross sections plus the tissue which has grown up from 
the cross sections or 100 mg callus samples are placed 
onto about 50 ml of Duncan's medium or of BN4 me- 
dium supplemented with the aforementioned concentra- 
tions of vancomycin and carbeniciliin and containing 
about 125 fig/ml of hygromycin B (Lilly) contained in 
Falcon 1005 Petri dishes. This test is read after three 
weeks of incubation in the dark at 27° C, by visually 
checking for growth. Cultures that are growing are 
light in color and show an increase in size. Using this 
test, positive growth phenotypes are recovered from 
cultures derived from CA19-inoculated seedlings show- 
ing that the seedlings' are transformed by the heterolo- 
gous hygromycin gene. 

EXAMPLE X 

A. Preparation of an^4. tumefaciens Strain Carrying the 
Gene Conferring Resistant to the Herbicide Glyphosate 

1. Culture of E coli RRlAM15/pCEL30 and Isola- 
tion of Plasmid pCEL30: E. coli RRlAM15/pCEL30 is 
grown as described above in Example IX, part Al, and 
plasmid pCEL30 is isolated as described in Example IX, 
part Al. ; 

2. Bq III* Digestion of Plasmid pCEL30 and Treat- 
ment With Calf Intestinal Phosphatase: Five u,g of plas- 
mid pCEL30 DNA are digested and treated with calf 
intestinal phosphatase as described above in Example 
IX, part A3. . 

3. Isolation of Glysophate-Resistant EPSP Sunthase 
Gene: A gene coding for a glyphosate-resistant 5-enol- 
phyruvylshikimate 3-phosphate synthase gene 
(GREPSPS gene) is isolated as described by Comai et 
al. in Nature, 317, 714 (1985) and by Stalker et al., J. 
Biol. Chem.. 260, 4724 (1985) which are incorporated 
herein by reference. In the final steps of this procedure, 
the GREPSPS gene, in a BamHI fragment cut from 
plasmid pPMG34, is cloned into plasmid pUC7 to give 
plasmid pPMG38. The GREPSPS gene is then excised 
from plasmid pPMG38 as an EcoRI fragment. The 
EcoRI fragment is then modified using a suitable com- 
mercially available molecular linker so that it is able to 
ligate with the unique Bglll site on the Bqlll-digested 
pCEL30 plasmid and so that the EcoRI site is removed. 

The herbicide glyphosate (N-phosphonomethylgly- 
cine) is a widely used broad-spectrum herbicide that 
kills both weed and crop species. It inhibits a metabolic 
step in the biosynthesis of aromatic compounds, and the 
cellular target of glyphosate is 5-enolphyruvyl shiki- 
mate 3-phosphate synthase (EPSP synthase) which 



catalyzes the formation of 5-enolpyruvylshikimate 3- 
phosphate from phosphoenolpyuvate and shikimate, 
and inhibition of this step of the shikimate pathway 
eventually leads to cellular death. The GREPSPS gene 
is a mutant allele of the aroA locus of Salmonella typhi- 5 
murium which encodes a EPSP synthase in which the 
substitution of a serine for proline causes a decreased 
affinity of the enzyme for glyphosate. 

4. Ligation: Ten ng of the phosphatased, Bglll-cut 
plasmid pCEL30, as prepared above in Example IX, 10 
part A3, are mixed with 50 ng of the GREPSPS gene 
(including the attached linker) in a 15 fil ligase buffer 
containing 0.8 units of T4 DNA ligase. The mixture is 
incubated overnight at 15* C. The ligation mixture is 
used to transform competent E. coli RR1AM15 cells as 15 
described in Example IX, part A3. 

5. Construction of Micro-Ti Plasmid Carrying the 
GREPSPS Gene: After selection of transformed cells 
produced as described in part A4 of this example on L 
medium containing ampicillin as described above in 20 
Example IX, part A3, the plasmids from the trans- 
formed £. coli RR 1AM 15 cells are transferred as de- 
scribed above in Example IX, part A4, as an EcoRI 
fragment, into broad- host-range vector pKT210. 

6. Conjugation Into A. tumefaciens LB A 4013: The 25 
plasmid carrying the GREPSPS gene on broad-host- 
range vector pKT210 is transferred into A. tumefaciens 
strain LBA 4013 by conjugation as described above in 
Example IX, part A 5. The resulting transconjugants are 
selected as described above in Example IX, part A 5, 30 
and a new strain of A. tumefaciens carrying the 
GREPSPS gene which is herein referred to as strain 
LBA 4013/GREPSPS, is isolated. 

B. Transformation of Corn 35 

Seeds of the inbred yellow Iochief strain of corn are 
sterilized, germinated, inoculated and further incubated 
as described above in Example I, parts B and C, except 
that the corn seedlings were inoculated with strain LBA 
4013/GREPSPS rather than strain B6. After 7 days of 40 
incubation, the infected seedlings are planted in pots in 
potting soil. 

C. Assays 

1. Assay For Enzymatic Activity in Pollen: Sixty 45 
days after the planting of the seedlings, samples of the 
pollen of five plants are individually assayed for the 
presence of lysopine dehydrogenase as described above 

in Example III, part B. The results of the electrophore- 
sis of the products produced by incubating the cell-free 50 
extracts of the pollen of the five plants show that three 
out of five of the cell-free extracts of the pollen produce 
octopine showing that the pollen is transformed. 

2. Assay for EPSP Synthase Activity: Leaves derived 
from the men stem are assayed seven weeks after plant- 55 
ing of the seedlings for EPSP synthase activity accord- 
ing to the method of Boocock and Coggins, FEBS Let- 
ters, 154, 127 (1983) which is incorporated herein by 
reference. Five leaves from five separate plants are 
assayed, and three are found to contain EPSP synthase 60 
activity showing that transformation had occurred. 

3. Assay For Resistance To Glyphosate: The three 
plants found to contain EPSP synthase activity in their 
leaves as a result of the assay in part C2 of this example 
are sprayed with the equivalent of 0.5 kg/hectare of the 65 
isopropylamine salt of glyphosate. All three plants 
show considerable tolerance to glyphosate as compared 

to controls. 
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None of the foregoing description of the preferred 
embodiments is intended in any way to limit the scope 
of the invention which is set forth in the following 
claims. Those skilled in the art will recognize that many 
modifications, variations and adaptations are possible. 

EXAMPLE XI 

A. Transformation of Corn 

Strain CA19 was cultured as described in Example 
IX, part A, and yellow Iochief corn was sterilized, 
germinated, inoculated with CA19 and inoculated as 
described in Example I, parts B and C. Seedlings were 
also inoculated with A. tumefaciens strain CA17 which 
is identical to strain CA19, except that the gene that 
codes for hygromycin resistance is inserted in the an- 
tisense direction. Finally, seedlings were also inoculated 
with YEB alone. After seven days of incubation, all of 
the inoculated seedlings were planted. 

B. Assays 

1. Assay For Enzymatic Activity in the Upper 
Leaves of Transformed Plants: As the plants reached 
sexual maturity, their upper leaves were assayed for the 
presence of lysopine dehydrogenase activity as de- 
scribed in Example III, part B2, except that lysopine 
dehydrogenase reaction medium was used. Leaves of 
five plants derived from seedlings inoculated with 
CA19, of two plants derived from seedlings inoculated 
with CAP and of three plants inoculated with YEB 
were assayed. 

The results are shown in FIG. 14. In FIG. 14, the 
lanes contain the following materials: 
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' S-l 


YEB 


Flag leaf 
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19-6 


CA19 


Leaf below 










flag leaf 
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19-6 


CA19 


Tiller 
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17-4 


CA17 


Ear shoot 
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CA19 


Ear shoot 
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Leaf below 










flag leaf 
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Leaf below 










flag leaf 
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Leaf below 










flag leaf 
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As shown in FIG. 14, none of the extracts of leaves 
derived from YEB-inoculated seedlings produced octo- 
pine, whereas 8 out of 12 extracts of leaves derived from 
CA19-inoculated and CA-17-inoculated seedlings pro- 
duced octopine. 
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2. Assay For Bacteria In Upper Leaves of Trans- 
formed Plants: Aliquots of the extracts of the leaves 
used in the lysopine dehydrogenase assay described in 
part B2 of this example were also plated to determine if 
any bacteria were present in these extracts. Any bacte- 5 
rial colonies growing up as a result of these platings 
were transferred to a lactose-containing medium diag- 
nostic for Agrobacteruim. None of the bacteria that 
grew up as a result of the original platings (either in 
extracts of leaves derived from YEB-inoculated seed- io 
lings, from CA19-inoculated seedlings; or CA17- 
inoculated seedlings) oxidized lactose to '! lactic acid 
showing that none of them were Agrobacterium of the 
type used to inoculate the seedlings. : 

3. Assay For Enzymatic Activity in Leaves of Fi 15 
Plants: Using the results of the above assay; plants 19-5 
and 19-3 were chosen for further study. Plant 19-5 was 
chosen because extracts of both of the two top leaves 
were positive for lysopine dehydrogenase activity indi- 
cating that this plant might have a transformed sector 20 
extending into the tassel. Plant 19-3 was chosen because 
extracts of its ear shoot and leaf below the flag leaf were 
positive for lysopine dehydrogenase activity indicating 
that this plant might have a transformed sector extend- 
ing into the ear and a transformed sector extending into 25 
the tassel. 

These two plants were self pollinated. Then, 26-27 
days post pollination, the immature progeny ears of 
plants 19-3 and 19-5 were removed from the plants and 
surface sterilized, The late maturity embryos of the ears 30 
from these plants were excised and planted on half- 
strength Murashige'and Skoog medium. The embryos 
were allowed to germinate sterilely in the light for 8-10 
days, at which time they were planted in soil. 

The meristem-derived leaves of the Fi seedlings 35 
which survived transplanting to soil were assayed for 
lysopine dehydrogenase activity as described in Exam- 
ple III, part B2, except that lysopine dehydrogenase 
reaction medium was used. The intensity of the staining 
of the spots on the electrophoretogram that co- 40 
migrated with octopine was rated. The results are pres- 
ented in Table 1. As shown there, some of the leaves of 
the seedlings produced octopine, showing the sexual 
transmission of this trait to the Fj generation. 
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TRANSFORMATION OF OTHER SPECIES OF *° 
^ GRAMINEAE 

EXAMPLE XII 

A single colony of the A. tumefaciens strain B6 was 
inoculated into a yeast extract broth, and the bacteria 65 
were incubated as described above in Example I, pan 
A. Seeds of rye were sterilized, germinated, inoculated 
and further incubated for 7-14 days as described above 
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in Example I, parts B and C. The seedlings were inocu- 
lated in the apical meristem, an area of rapidly dividing 
cells that gives rise to the germ line cells. 

At the end of the 7-14-day incubation period, the 
seedlings were assayed for enzyme activity as described 
above in Example I, part D. The results of the electro- 
phoresis of the products produced by adding the cell- 
free extracts of the B6-inoculated rye seedlings to lyso- 
pine dehydrogenase reaction medium are shown in 
FIG. 10. In that figure, lane 1 contains the synthetic 
octopine standard and lanes 2-6 contain the product 
produced by incubating the cell-free extract of single 
B6-inoculated rye seedlings with lysopine dehydro- 
genase reaction medium. The results shown in FIG. 10 
demonstrate that octopine production is caused by the 
cell-free extracts of three out of four B-6-inoculated rye 
seedlings tested (lanes 2, 3, 4 and 6). These results show- 
that the rye seedlings have been transformed by infec- 
tion with the vir+ A. tumefaciens strain B6. 

EXAMPLE XIII 

A single colony of the A. tumefaciens strain B6 was 
inoculated into a yeast extract broth, and the bacteria 
were incubated as described above in Example I, pan 
A. Barley seeds were sterilized, germinated, inoculated 
and further incubated for 7-14 days as described above 
in Example I, parts B and C. The seedlings were inocu- 
lated in the apical meristem, an area of rapidly dividing 
cells that gives rise to the germ line cells. 

At the end of the 7-14-day incubation period, the 
seedlings were assayed for enzyme activity as described 
above in Example 1, part D. The results of the electro- 
phoresis of the products produced by adding the cell- 
free extracts of B6-inoculated barley seedlings to lyso- 
pine dehydrogenase reaction medium are shown in 
FIG. 11. In that figure, lane 1 contains the synthetic 
octopine standard and lanes 2-6 contain the product 
produced by incubating the cell-free extract of single 
B6-inoculated barley seedlings with lysopine dehydro- 
genase reaction medium. The results shown in FIG. 11 
demonstrate that octopine production is caused by five 
out of five cell-free extracts of B6-inoculated barley 
seedlings tested and show that the seedlings have been 
transformed. 

EXAMPLE XIV 

A single colony of the A. tumefaciens strain C58 was 
inoculated into YEB, and the bacteria were incubated as 
described above in Example II, part A. Oat seeds were 
sterilized, germinated, inoculated and further incubated 
for 7-14 days as described above in Example I, pans B 
and C. The seedlings were inoculated in the apical meri- 
stem, an area of rapidly dividing cells that gives rise to 
the germ line cells. 

At the end of the 7-14-day incubation period, the 
seedlings were assayed as described above in Example 
II, part D. The results are shown in FIG. 12. In FIG. 12, 
lane 1 contains nopaline dehydrogenase reaction me- 
dium alone, lane 2 contains the nopaline standard, lanes 
3-11 contain the product produced by incubating the 
cell-free extracts of single C58-inoculated oat seedlings 
with nopaline dehydrogenase reaction medium and lane 
12 contains synthetic octopine. As can be seen, six out 
of nine of the cell-free extracts of single C53-inoculated 
oat seedlings produced nopaline showing that the seed- 
lings were transformed. 
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EXAMPLE XV 

A single colony of the A. tumefaciens strain C58 was 
inoculated into YEB, and the bacteria were incubated as 
described above in Example II, part A. Wheat seeds 5 
were sterilized, germinated, inoculated and further in- 
cubated for 7-14 days as described above in Example I, 
parts B and C. The seedlings were inoculated in the 
apical meristem, an area of rapidly dividing cells that 
gives rise to the germ line cells. 10 

At the end of the 7-14-day incubation period, the 
seedlings were assayed as described above in Example 
II, part D. The results are shown in FIG. 13. In FIG. 13, 
lane 1 contains the nopaline standard, lanes 2-6 contain 
the product produced by incubating the cell-free ex- 15 
tracts of single C58-inoculated wheat seedlings with 
nopaline dehydrogenase reaction medium, lane 7 con- 
tains synthetic octopine, lane 8 contains a mixture of 
synthetic octopine and lysopine dehydrogenase reac- 
tion medium and lanes 9-15 contain the product pro- 20 
duced by incubating the cell-free extracts of single C58- 
inoculated wheat seedlings with lysopine dehydro- 
genase reaction medium. As can be seen, three out of 
five of the cell-free extracts of the C58-inoculated 
wheat seedlings produced nopaline when incubated 25 
with nopaline dehydrogenase reaction medium, show- 
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ing that the seedlings were transformed. None of the 
cell-free extracts produced octopine when incubated 
with lysopine dehydrogenase reaction medium. 
We claim: 

1. A method of producing transformed Gramineae, 
said method comprising: 

making a wound in a graminaceous seedling with 
newly emerged radicle and stem, said wound being 
made in an area of the seedling containing rapidly 
dividing cells, wherein said area extends from the 
base of the scutellar node to slightly beyond the 
coleoptile node; and 

inoculating the wound with vir+ Agrobacterium tu- 
mefaciens. 

2. The method of claim 1 wherein about four wounds 
are made in the seedling, and a total of about 10 8 Agro- 
bacterium tumefaciens cells are used to inoculate the 
wounds. 

3. The method of claim 1 wherein the vir + Agrobacte- 
rium tumefaciens contains a vector comprising geneti- 
cally-engineered T-DNA. 

4. The method of claim 1 or 3 wherein the Gramineae 

is selected from the group consisting of corn, wheat, 

rye, barley and oats. 

***** 
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a partially inbred initial population, the results were similar to 
those observed (Table 1). 

With the asymmetrical overdominance model, populations at 
equilibrium under high seifing do not maintain variation, but 
will fix the allele least disadvantageous in homozygotes though 
an increase will occur if the set of lines has not reached equili- 
brium, so that some still remain with the lower fitness allele. 
One would thus expect no increase in fitness on intercrossing 
the inbred lines after the population reaches equilibrium. During 
inbreeding, the fitness values behaved similarly to those in the 
mutational model, with a slight decrease, rather than increase, 
in fitness (Fig. 2c; Table 1). But the intercross mean fitness, 
although higher than the inbred fitness, was never higher than 
the fitness of the initial population, unlike the mutational load 
model and the observed data (Table 1). 

These results support the partial dominance (mutational). 
hypothesis 2,7 for genetic load, and suggest that the mutation 
rate per genome in £. paniculata must be very high (of the order 
of one per generation). This is consistent with most other avail- 
able data 15,16 . The failure of.fitness to recover during inbreeding, 
and intercrossed fitness levels higher than those of the original 
outbred strains, are also seen in maize 17 and mice 9 (Table 1). 
This suggests that purging of partially recessive mutations 
accounts for an important component of heterosis. When the 
fitness values under inbreeding are plotted against the inbreeding 
coefficient, the partial dominance model yields an increase with 
high F values, due to purging, but the overdominance model 
predicts a monotonic decline. The partial dominance model can 
thus also explain the observed patterns in inbreeding experi- 
ments with conifers 18 . 

The occurrence of natural selection during the inbreeding 
experiment, consistent with purging, was indicated by excess 
heterozygosity of five (presumably neutral) allozyme marker 
loci scored in the experimental plants, each generation of 
inbreeding (data not shown). Under inbreeding, heterozygotes 
tend to be produced by outcrosses. Surviving adults in the 
progeny generation therefore tend to be heterozygotes 19,20 , and 
excess heterozygosity compared with the neutral expectation is 
frequently found in inbreeding plants 21 . 

These results are also relevant to populations undergoing 
disturbance by humans. Inbreeding of formerly outbred popula- 
tions occurs in many zoo populations, as well as domesticated 
populations and populations subjected to severe reductions in 
size as a result of human alterations of the environment. Extreme 
inbreeding has been recommended to purge genetic load and 
force the adaptation of endangered populations to the inbreed- 
ing regime they will experience under human management 22 . 
This assumes that lowered fitness inevitably caused by increased 
homozygosity during this process will not be too severe or 
prolonged. The validity of this assumption depends on the 
severity and dominance of the mutant alleles in the population 
being inbred 23,5 . Our results indicate that purging by the most 
severe inbreeding, self-fertilization, could decrease fitness con- 
siderably, with little recovery under inbreeding, and that fitness 
is restored only when the inbred lines are intercrossed. □ 
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Transgenic plant aequorin reports 
the effects of touch and 
cold-shock and elicitors on 
cytoplasmic calcium 

Marc R. Knight*, Anthony K. Campbellt, 
Steven M. Smith* & Anthony J. Trewavas* 

* Institute of Cell and Molecular Biology, University of Edinburgh, 
Edinburgh EH9 3JH, UK 

t Department of Medical Biochemistry, University of Wales College of 
Medicine, Cardiff CF4 4XN, UK 

Methods for measuring plant cytoplasmic calcium using micro- 
electrodes or microinjected fluorescent dyes are associated with 
extensive technical problems, so measurements have been limited 
to single or small groups of cells in tissue strips or protoplasts 1,2 . 
Aequorin is a calcium-sensitive luminescent protein 3 from the 
coelenterate Aequorea victoria (A.forskalea) which is formed from 
apoaequorin, a polypeptide of relative molecular mass —22,000, 
and coelenterazine, a hydrophobic luminophore 4 . Microinjected 
aequorin has been widely used for intracellular calcium measure- 
ment in animal cells 4 , but its use in plants has been limited to 
exceptionally large cells 5 . We show here that aequorin can be 
reconstituted in transformed plants and that it reports calcium 
changes induced by touch, cold-shock and fungal elicitors. Recon- 
stituted aequorin is cytoplasmic and nonperturbing; measurements 
can be made on whole plants and a calcium indicator can be 
constituted in every viable cell. Now that apoaequorin can be 
targeted to specific organelles, cells and tissues, with the range of 
coelenterazines with differing calcium sensitivities and properties 
available 6 , this new method could be valuable for determining the 
role of calcium in intracellular signalling processes in plants. 

The apoaequorin-coding region from complementary DNA 
clone pAEQl (ref. 7) was fused to the cauliflower mosaic virus 
(CMV) 35S promoter 8 and transferred to Nicotiana plum- 
baginifoiia using the Agrobacterium tumefaciens pBIN19 binary 
vector system 9 to provide constitutive expression (Fig. la). 
Expression was detected by western blots, using antibodies 
raised against aequorin purified from Aequorea 10 , in all plants 
transformed with the 35S-apoaequorin chimaeric gene (Fig. Id). 
F t progeny from the transformant expressing the highest levels 
of apoaequorin (MAQ2.4; Fig. 16) were selfed and homozygous 
F 2 progeny were used for subsequent experiments. 

The time course for aequorin reconstitution from apoaequorin 
in whole seedlings, in seedling homogenates and from 
apoaequorin purified from Aequorea, all show similar kinetics 
(Fig. 2). Seedlings incubated in coelenterazine were transferred 
back to agar and continued to grow vigorously and respond 
tropically to light and gravity. No signs of toxicity were detected 
and growth rates of coelenterazine-treated and untreated seed- 
lings were similar. The ratio of reconstituted aequorin in coty- 
ledons, shoots and roots was 1 : 1 : 0.25 on a fresh-weight basis. 
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CMV 35S promoter poly (A) 



RG. 1. Construction of chimaeric gene used to 
express apoaequorin in Nicotiana. b, Expression 
of apoaequorin in Nicotiana Western blot analysis 
of total protein from the F x progeny of separate 
transformants. Lane 1, total protein from £ coli 
expressing apoaequorin 19 ; lanes 2-7, separate 
Nicotiana 35S-apoaequorin transformants, 
MAQ3.3. MAQ3.2, MAQ2.13, MAQ2.4, MAQ2.3 and 
MAQ2.1; lane 8, wild-type untransformed 
Nicotiana. 

METHODS. The apoaequorin coding region was 
amplified by polymerase chain reaction (PCR) from 
cDNA clone pAEQl (ref. 7) (the kind gift of Milton 
Cormier, University of Georgia), before construction 
of this chimaeric gene, to remove homoploymeric 
dC-dG tails present at either end of the cDNA 
insert. This counteracted inhibition of expression 
detected in constructs containing these tails (not 
shown). The XbaX-Pstl fragment was transferred 

to pDH51 (ref. 20) containing the CMV 35S promoter and terminator and 
the whole chimaeric gene was transferred to pBIN19 as an EcoRl insert. 
All recombinant DNA manipulations and Nicotiana genetic transformations 
followed standard procedures 21 - 22 . Total protein was prepared from 1 mg 
of 8-day transformant seedling tissue, electrophoresed on a 10% SDS- 
polyacrylamide gel and transferred to a nitrocellulose filter by electroblotting. 
After overnight incubation in a 1:1,000 dilution of mouse primary antibody 
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to aequorin isolated from Aequorea, the filter was incubated in a 1:5,000 
dilution of alkaline phosphatase conjugated anti-mouse IgG for 2 h. Colour 
was developed by incubation in lOOu-gml -1 nitroblue tetrazoitum, 
60 jig ml -1 BCIP (5-bromo-4-chloro-3-indolyl phosphate), 4mM MgCI 2 . 
100 mM Tris. pH 9.5. Protein electrophoresis and western blot analysis were 
according to standard procedures 23 . 



The cellular locale of the apoaequorin was estimated by cen- 
trifuging homogenates of transgenic seedlings at 116,000g and 
estimating distribution with coelenterazine by reconstitution. 
Over 97% of apoaequorin was contained in the soluble 
fraction. 

To investigate the effect of touch, seedlings were placed in a 
luminometer and the cotyledons touched with a fine wire or 
plastic rod at intervals of 1 min (Fig. 3a). With each touch an 
intracellular calcium concentration ([Ca 2+ ]j) spike was 
observed. When repeatedly touched on the cotyledons at about 
1-s intervals, a maximal response was elicited with about four 
touches (data not shown). Slight movement of the cotyledons 
on touching seems essential to this reponse. Touch responses 
are often thought to be limited to tendrils and sensitive plants 
such as Mimosa pudica and the Venus fly trap {Dionaea mus- 
cipula). Recent data 11 , however, show that a variety of touch 
stimuli substantially reduces growth of Arabidopsis and other 
plants. In this case the involvement of [Ca 2+ ]j in transducing 
touch signals was surmised from increased calmodulin gene 
expression. The data in Fig. 3 a support this hypothesis much 
more directly. 

The effect of imposing temperature fluctuations on seedlings 
from an ambient 20 °C to temperatures from 0 °C to 50 °C is 
shown in Fig. 3 b. Dramatic increases in [Ca 2+ ]i accompany the 
transition to 0 °C or 5 °C. Such plants can be rewarmed and on 
recooling exhibit a further [Ca 2+ ]j spike. The spike is higher if 
10 mM Ca 2+ is included in the chilling medium but has no effect 
at higher temperatures. Only irrigation with Ca 2+ at 100 mM 
has any effect at 20 6 C, causing repetitive spiking. No large 
increases due to heat-shock temperatures (40°C+) were 
observed. Many agronomically important plants are chill-sensi- 
tive and it has been suggested that chilling injury results from 
a failure in [Ca 2+ L homeostasis 12 . This method of [Ca 2+ ]j 
measurement will permit a proper test of this hypothesis in 
chill-sensitive plants. Temperate plants use fluctuating tem- 
perature regimes as signals modifying dormancy, reproduction 
and inducing temperature adaptation. A role for [Ca 2+ ]; in 
transducing these signals is now clearly indicated. 

A variety of fungal elicitors which initiate host-plant defence 
responses of hydrolytic enzyme production 13 , phytoalexin pro- 
duction and lignification 14 have been reported in many species 
including tobacco 15 . Elicitor preparations from yeast and Glio- 
cladium deliquescens have been shown to stimulate phytoalexin 
production in plant cells 16 ' 17 . These elicitor preparations also 
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RG. 2. Reconstitution of aequorin from apoaequorin in whole tobacco seed- 
lings, tobacco homogenates and from purified Aequorea apoaequorin. a, 
Reconstitution time course of aequorin in whole Nicotiana seedlings (■). b, 
Reconstitution time course in vitro of apoaequorin isolated from Aequorea 
(A) and Nicotiana (□). Reconstitution was estimated by measuring the 
amount of calcium-dependent luminescence produced from the reconstituted 
aequorin at each time point. 

METHODS. Apoaequorin was produced from Aequorea aequorin as previously 
described 24 . Reconstitutions of apoaequorin in Nicotiana homogenates and 
from Aequorea were in O.SMNaCI. SrriM mercaptoethanol, 5mM EDTA, 
0.1% gelatin (w/v). 10 mM Tris-HCI. pH 7.4, 2.5 jiM coelenterazine (gift from 
F. McCapra) at room temperature in darkness. Aequorin was reconstituted 
in whole Nicotiana seedlings by floating 8-day-old seedlings in water contain- 
ing coelenterazine at 1 uJvl in darkness. Before luminescence measurements, 
seedlings containing reconstituted aequorin were homogenized in 200 mM 
Tris, 0.5 mM EDTA. pH7.0, and in vitro assays were diluted 1:50 in the 
same buffer. Reconstituted aequorin was then discharged by the addition 
of an equal volume of 50 mM CaCI 2 and the total amount of luminescence 
produced over 10 s measured. Reconstitution in whole Nicotiana seedlings 
was dependent on coelenterazine concentration up to 10 jiM. Reconstituted 
aequorin in tobacco was relatively stable, with more than 80% remaining 
after 48 h. To test the ability of reconstituted aequorin to report [Ca 2+ ] J( 
seedlings were treated with mild detergent and extracellular calcium 
(lCa 2+ ] e ) to different concentrations. This treatment produced increases in 
luminescence with increasing [Ca 2+ ] e (data not shown). 
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FIG. 3. The effect of touch-simulus (a) temperature-shock (b) and fungal 
elicitors (c) on calcium-dependent aequorin luminescence. 
METHODS. Transgenic tobacco seedlings (8 days old) were grown on half- 
strength M and S medium 25 , 0.8% (w/v) agar. These were then incubated 
in water containing coelenterazine at 2 yJM (a. b) or 1 yJVl (c) for 6h in 
darkness. For luminescence measurements these seedlings were transfer- 
red singly to plastic cuvettes and placed in a luminometer. a. Seedlings 
were touched once every minute (at arrows) for 4 successive minutes with 
a fine wire, after which water at 0 °C was added, b. Seedlings were rapidly 
transferred from 20 °C to temperatures in the range 0-50 °C by the addition 
* of water at the appropriate temperature. Only the treatments at 0 °C, 5 °C, 
10 °C, 20 °C and 40 °C are shown, c The following elicitor preparations were 
added: (1) untreated yeast elicitor 16 ; (2) trifluoroacetic acid (TFA)-hydrolysed 
yeast elicitor; (3) void volume from a Sephadex G-25 column of yeast elicitor 
preparation; (4) proteinase K-digested yeast elicitor; and (5) untreated 
Gliocladium deiiquescens elicitor 17 (G. deliquescens spores a gift from E. 
Groskurt). TFA hydrolysis was carried out at 120 °C for 1 h with TFA at 2 M. 
Proteinase K digestion was at 37 °C for 1 h at an enzyme concentration of 
lmgml" 1 . Luminescence was recorded by a digital luminometer and the 
output sent to a chart-recorder. All plants tested responded to the stimuli 
described. Increases in [Ca 2+ ]j showed low variability between plants in 
response to reproducible stimuli, for example, cold shock (o), but higher 
variability in response to unquantifiable stimuli such as touch (a). After 
measurements seedlings were removed from the luminometer and transfer- 
red back to agar, where both vigorous growth and tropic sensitivity were 
observed. Two other independent tobacco transformants (MAQ2.3 and 
MAQ2.13; Fig. lb) showed the same responses (data not shown). No increase 
in luminescence was detected from untransformed seedlings treated with 
coelenterazine, nor from untreated transformed seedlings in response to 
the above stimuli (data not shown). 
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increase [Ca 2+ ]i (Fig. 3c). The active fraction in yeast is found 
in the void volume of a Sephadex-G25 column (Fig. 3c) but not 
G50 (data not shown); it shows some lability to trifluoroacetic 
acid under conditions that hydrolyse glycosidic linkages and is 
completely destroyed by proteinase K (Fig. 3c). Therefore, it is 
most probably a peptide of relative molecular mass ( M r ) 5,000- 
30,000 (5-30K) with attached carbohydrate. Very few fungal 
elicitors have been characterized although some are carbohy- 
drate and some protein 14 . Characterization of elicitors has so 
far required assays of enzymes induced in plant cells often over 
time courses of several days. This method of [Ca 2+ ]i measure- 
ment may greatly simplify elicitor purification and characteriz- 
ation owing to the rapidity of the assay. It has been speculated 
that elicitor signals are transduced through [Ca 2+ ]j (ref. 18). 
Again our data directly support this hypothesis. 

The luminescence observed in these experiments (Fig. 3) can 
only be a qualitative measure of [Ca 2+ ]i variation. The numbers 
and types of cells responding cannot be determined and there 
will be physical barriers (for example, cell walls) to detection 
of luminescent light from cell layers deeper than two or three 
cells from the surface. Most probably epidermal cells are the 



main contributor. The amount of aequorin discharged in any 
of these treatments is, however, below 5% of the total determined 
by homogenization and discharge. It cannot at present be deter- 
mined whether this discharged aequorin can be regenerated. 
But the continuous de novo synthesis of apoaequorin should 
allow reconstitution of aequorin several times in the same cell. 

We are now analysing the source of the [Ca 2+ ]i increases 
described in Fig. 3 by preparing constructs that will target 
aequorin to plant organelles. The kinetics of the [Ca 2+ ]j 
transients induced by touch, cold shock and elicitors are all 
different. They are very short for touch, intermediate for cold- 
shock and long for elicitors. Different cellular sources for [Ca 2+ ]i 
may explain these variations. Without this considerable simplify- 
ing advance in technology these measurements would be very 
difficult to make. The availability of a range of coelenterazines 
with different calcium sensitivities 6 can be expected to greatly 
improve measurements of different organelle calcium and pro- 
vide additional flexibility to this technique. The goal of monitor- 
ing crop plant [Ca 2+ ]j in the field, which could be used diaghosti- 
cally, is now brought much nearer with many consequent 
agricultural benefits. □ 
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The spatial pattern of expression for a maize gene encoding a hydroxyproline-rich glycoprotein (HRGP) was 
determined by in situ hybridization. During normal development of roots and leaves, the expression of the gene 
was transient and particularly high in regions initiating vascular elements and associated sclerenchyma. Its 
expression was also associated with the differentiation of vascular elements in a variety of other tissues. The gene 
encoded an HRGP that had been extracted from the cell walls of maize suspension culture cells and several other 
embryonic and post-embryonic tissues- The gene was present in one or two copies in different varieties of maize 
and in the related monocots teosinte and sorghum. A single gene was cloned from maize using a previously 
characterized HRGP cDNA clone [Stiefel et al. (1988). Plant Mol. Biol. 11, 483-493]. In addition to the coding 
sequences for the HRGP and an N-terminal signal sequence, the gene contained a single intron in the nontranslated 
3' end. 



INTRODUCTION 



The specialization of cell wall architecture is an important 
feature of the functional differentiation of plant cells (Varner 
and Lin, 1989). For example, the rigidity of sclerenchyma 
cells, the pressure-resistant nature of xylem cells, and the 
gas impermeability of photosynthetic bundle sheath cells 
ail rely on the properties of specialized walls that each of 
these cells deposits during differentiation. The wall of each 
distinct cell type appears to have a characteristic combi- 
nation and spatial organization of polysaccharides, struc- 
tural proteins, and other wall components, often with 
unique patterns of cross-links and other modifications (Fry, 
1986; Cassab and Varner, 1988). These structural spe- 
cializations are accomplished in part through the differen- 
tial expression of genes encoding wall structural proteins 
and synthetic and modification enzymes. 

The cell-specific expression of several genes encoding 
specialized wall components has been described in dicots. 
A gene encoding phenylalanine ammonia-lyase, the en- 
zyme catalyzing the first step in the synthesis of lignin 
monomers, is expressed at developing vascular centers, 
coincident with the differentiation of lignified xylem ele- 
ments (Bevan et al., 1989; Liang et al., 1989). Similarly, a 
gene encoding a member of the wall glycine-rich protein 

1 To whom correspondence should be addressed. 



(GRP) class that is accumulated in lignified secondary walls 
of xylem elements is expressed specifically in differentiat- 
ing protoxylem cells (Keller et al., 1989a). The genes for 
several members of another class of wall-associated 
proteins, the hydroxyproline-rich glycoproteins (HRGPs), 
have been shown to have distinct cellular patterns of 
expression. 

The most studied of the HRGPs are the dicot extensins 
(Cassab and Varner, 1988; Showalter and Varner, 1989). 
Extensins contain a characteristic repeat of the pentapep- 
tide Ser-Pro 4 , in which proline residues are hydroxylated 
and glycosylated. Extensins become insoluble with time 
and contribute to the mechanical strength of the wall, 
probably via cross-linking to other monomers and to other 
wall components (Fry, 1986). Extensins have been local- 
ized in dicots to the walls of several cells in which strength 
is a key property, including seed coat sclerenchyma and 
cotyledon vascular elements (Cassab and Varner, 1987). 
Individual extensin genes are expressed in tomato and 
tobacco with temporal and spatial patterns that suggest a 
high degree of developmental control (Showalter et al., 
1985; Memelink et al., 1987). Extensin accumulation can 
also be induced by fungal infection and by wounding, 
although distinct extensin genes may be induced in each 
case (Showalter et al., 1985; Corbinet al., 1987). Recently, 
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a distinct tobacco HRGP was described that is specifically 
synthesized and accumulated at sites of lateral root initia- 
tion (Keller and Lamb, 1989). 

Here we describe the expression pattern of a gene 
encoding a maize HRGP. We reported previously the 
isolation of a cDNA encoding a protein consisting of 13 
repeats of a proline- and threonine-rich peptide (Stiefel et 
al., 1988). This protein contains only a single copy of the 
Ser-Pro 4 motif repeated in extensins. The deduced protein 
probably corresponds to the hydroxyproline- and threo- 
nine-rich glycoproteins extracted from maize pericarp 
(Hood et al., 1988) and cell walls of suspension culture 
cells and various seedling and embryo tissues (Kieli- 
szewski and Lamport, 1987; Kieliszewski et al., 1990). 
The correspondence of these protein preparations to the 
protein predicted by cDNA sequencing was recently con- 
firmed by peptide sequencing (Kieliszewski et al., 1990). 
The gene encoding this protein is present in one or two 
copies in the genomes of maize, teosinte, and sorghum. 
The maize transcript is interrupted by a single intron in the 
3' untranslated region. We show that the gene is ex- 
pressed at sites of early vascular differentiation in em- 
bryos, coleoptiles, leaves, hypocotyl, and both primary and 
lateral roots, as well as at much lower levels throughout 
the developing plant. 



RESULTS 



A H E B S K B S T PT A B 




Figure 1. Genomic DNA Blot Analysis of Sequences Homologous 
to the HRGP cDNA in Maize and Related Species. 

(A) DNA (10 ^g) from the maize W64A variety was digested with 
Hindlll (H), EcoRI (E), BamHI (B), Sad (S), and Kpnl (K). 

(B) DNA (10 fig) from different sources was digested with Sack 
Samples are from sorghum (S), teosinte (T), and the maize vari- 
eties Patomero Totuqueno (PT), A188 (A), and Black Mexican 
Sweet (B). In both cases HRGP cDNA (probe MC56 from Stiefel 
et al., 1988) was used as a probe. Position of size markers (in kb) 
is shown. 



Maize HRGP Is Encoded by a Gene with a 3' Intron 

We previously described the isolation of a cDNA encoding 
a maize HRGP and showed that the corresponding mRNA 
is enriched in tissues with mitotic activity (Stiefel et al., 
1988) and in wounded tissues (Ludevid et al., 1990). The 
extensins, a group of HRGP proteins that have been found 
in differentiating and wounded tissues of several dicot 
species (Showalter and Varner, 1989), are encoded by 
small gene families. We performed genomic blot analysis 
to determine the number of sequences in the maize ge- 
nome and in those of the monocots teosinte and sorghum. 
Genomic DNA of maize, teosinte, and sorghum was di- 
gested with a variety of restriction enzymes, blotted to 
nylon membrane, and probed with the maize HRGP cDNA 
(pMC56), all as described in Methods. Figures 1A and 1B 
show that the HRGP gene is present in one or two copies 
in the genomes of four different maize inbred lines. Using 
the same cDNA probe, the gene has been mapped to a 
single locus on maize chromosome 2 by RFLP analysis 
(locus UMC 145, C. Guitton and D. Hoisington, personal, 
communication). Consistent with this, independent cDNAs 
from plumule (leaf) and root RNA were found to have 
identical sequences in noncoding regions, suggesting that 
these different tissues contain transcripts from a single 
gene (V. Stiefel, unpublished results). Figure 1B shows 



that the teosinte and sorghum genomes contain similar 
sequences, also in relatively low copy number. 

We cloned a genomic copy of the sequences repre- 
sented in the HRGP cDNA, as described in Methods. 
Several overlapping clones were isolated from a maize 
inbred AC1503 library in XEMBL3, and one was chosen 
for sequence analysis. The DNA sequence of 1 .8 kb in- 
cluding the HRGP gene is shown in Figure 2. A TATA box 
and polyadenylation signal are found at expected locations 
in 5' and 3' regions flanking the HRGP coding region at 
-112 bp and +1492 bp relative to the initial ATG. Up- 
stream of the TATA box, several short repetitive se- 
quences and polypyrimidine-polypurine stretches are 
found. 

Comparison of the HRGP genomic sequence with the 
previously determined cDNA sequence (Stiefel et al., 1988) 
revealed some minor sequence differences in the 3' un- 
translated region, probably due to the difference in maize 
inbred varieties used for cDN A and genomic cloning (W64A 
versus AC1503). Comparison of sequences in the coding 
region indicated that the characterized maize cDNA rep- 
resents an mRNA from this gene. In addition, the genomic 
HRGP clone includes a 166-bp sequence (underlined in 
Figure 2) delimited by consensus sequences for intron 
splice junctions. An extensin gene from carrot has this 
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Figure 2. Nucleotide Sequence of the Maize HRGP Gene and 
the Corresponding Protein Sequence. 

Nucleotides are numbered from the A of the start codon for 
JSnSoM+^J Putative TATA box and polyadeny.at.on s.gnafc 
« toZ The intron in the 3' untranslated end is undent! 
TheSSced artno acid sequence for HRGP is depicted as s,ngle 
tettefSSf ThTsite of processing of the mature prote,n between 
Ala 26 and Asp 27 is indicated by a vertical arrow. 



same unusual feature of an intron in the 3^ untranslated 
region of the transcript (Chen and Varner, 1985). 

The protein predicted by the genomic sequence was 
identical to that predicted by the cDNA, with the add.fonal 



N-terminal amino acids that were rn.ss.ng .n the (incom- 
plete) cDNA. Figure 3 shows that the predicted proton 
begins with a hydrophobic stretch having features typical 
ofa signal peptide. As confirmed by N-term.nal am.no and 
sequencing (see below), the mature protein sequence be- 
gins with the aspartic acid residue in posifon 27 followed 
By a short stretch (8 residues) of glycine and tyrosine 
residues. This is followed by a proline-rich region that 
includes the repeated peptide G^^y^^S't 
Lys-Pro-Pro-Lys-Glu-His. Finally, the prohne- threw ine- 
rich region that corresponds to the sequenced cDNA be- 
gins. The repeated hydrophilic/hydrophob.c character of 
the unmodified mature protein is very apparent .n th.s 

■Cffi&h. N-termina. amino acid of the = 
protein by N-terminal amino acid sequencing of the prate n 
encoded by the maize HRGP gene. A cell wan prote.n w.th 
Se Jtics of the protein predicted by the above se- 
quence has been isolated from maize pencarp (Hood et 
al 1 988) suspension culture cells (Kiel.szewsk. and Lam- 

^Kieliszewski et a., 1990). The amino .acic ^J-J^ 
these proteins matches that predicted from the HRGP 
DNA sequence (Table 1), and Kiel.szewsk, et aL (1990) 
confirm this with partial amino acid sequencing. We ^ used 
a similar purification scheme (see Methods) to isolate the 
Corresponding protein fraction from maize coleopt.les This 
orotein has The amino acid composition (Table 1) and 
plated chymotryptic cleavage pattern (data no own) 
expected for the encoded HRGP. As reported by K.el, 
szewski et al. (1990), the extracted protem migrates as a 
Se band at approximately 70 kD on SDS polyacryl- 
amTde gete. presumably due to its 
and high proline content. We determ.ned the N-term.nal 




acid number. 
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Table 1. Amino Acid Composition (in mol %) of Maize HRGPs 
from Different Sources 



E a P" PC-1 C THRGF* 



HP* 




24.7 


21.9 


24.8 


P 


45.4 


11.7 


■IOC 




T 


23.8 


24.0 


17.5 


25.3 


K 


12.2 


15.0 


11.3 


13.5 


Y 


6.6 


4.6 


4.6 


3.9 


S 


4.0 


4.7 


5.5 


7.3 


G 


2.3 


5.1 


7.1 


2.4 


A 


1.7 


5.7 


5.2 


1.7 


E 


1.3 


2.3 


2.5 


2.3 


H 


1.7 


1.8 


3.6 


2.4 


D 


0.7 


2.4 




0.7 


V 


0.3 


0.9 


2.7 


0.7 


Others 




1.0 


4.6 


0.5 



a Expected amino acid composition deduced from the nucleotide 

sequence of the HRGP gene. Amino acids corresponding to signal 

peptide (residues 1 to 26) were not computed. 

b HRGP protein purified from maize coleoptiles as described in 

Methods. 

c HRGP protein purified from maize pericarps (Hood et al, 1988). 
d HRGP protein from Black Mexican Sweet suspension cultures 
(Kieliszewski and Lamport, 1987). 

e HP, hydroxyproline. 



sequence of the extracted protein by Edman degradation 
with a gas phase sequencer. Sequencing cycles 4, 6, and 
10 gave tyrosine residues followed by the sequence Thr- 
Pro-Thr-Pro-X-Pro-Ala. This sequence appears at amino 
acids 37 to 43 of the genomic sequence open reading 
frame. Residues at positions 1 to 3, 5, and 7 to 9 (corre- 
sponding to glycine) relative to the N terminus were not 
clearly assigned due to a high background. From the amino 
acid sequence predicted by the genomic clone, this indi- 
cates that amino acid 1 is aspartate. The predicted hydro- 
phobic sequence prior to this aspartate corresponds to a 
signal peptide that is probably processed at the Ala-Asp 
junction between positions 26 and 27 of the predicted 
protein. This would make the mol wt of the processed 
protein (without modifications) 31 ,729 D. 

HRGP Transcripts Are Localized at Sites of Vascular 
Differentiation 

Our previous work suggests that mRNA for the maize 
HRGP and extractable HRGP are enriched in a variety of 
tissues coincident with mitotic activity during normal de- 
velopment or after wounding (Stiefel et al., 1988; Ludevid 
et al., 1990). HRGP mRNA abundance decreases dramat- 
ically in mature tissue. To localize more precisely the 
accumulation of HRGP mRNA, we used cDNA pMC56 as 
a probe for in situ hybridizations with developing and 
mature tissues of the shoot and root systems. Figures 4 
to 7 show that mRNA was localized to regions of vascular 



differentiation in hypocotyl and mesocotyl regions of • 
embryo axis (1 day post-germination), the first node >j 
developing shoot, leaf primordia in the plumule (6 & c , 
post-germination), and sheaths of more mature developing 
leaves. The resolution of the hybridization technique does 
not permit the assignment of signals over specific vascular 
elements, although signals are strongest over xylem 
elements and surrounding sclerenchyma. In each case, 
the most abundant signal appeared at a morphological 
stage in which xylem tracheary element differentiation is in 
progress. 

HRGP mRNA accumulated transiently in region 
undergoing vascularization in developing primary and lat- 
eral roots. Figure 8 shows an oblique section through the 
tip region of a primary root. Hybridization was strongest in 
regions of differentiating metaxylem and protoxylem ele- 
ments. The region of hybridization was always localized in 
a differentiating zone near the tip and was absent in more 
distant (more mature) regions. Figure 9 shows the highly 
localized accumulation of mRNA at sites of lateral root 
initiation, just behind the advancing lateral tip. Hybridiza- 
tion appeared to be most intense in regions of vascular 
differentiation arid formed a continuous connection from 
the primary root vascular cylinder to the new lateral root 




Figure 4. Localization of HRGP Transcripts in the Maize Embryo 
Axis. 

Sections of 1 day post-germination embryos were hybridized with 
the HRGP probe as described in Methods. 



Maize HRGP Expression in Vascularization 789 




Figure 5. Localization of HRGP Transcripts in the Region of the Coleoptilar Node. 

Longitudinal sections of 7 day post-germination plants were hybridized with the HRGP probe. 

(A) Coleoptilar node region, c, coleoptile; 1 , first leaf; 2, second leaf. 

(B) Enlargement of same section, hybridized with antisense strand (signal) probe. Arrows indicate signal over coleoptilar vein and central 
vein in axis. 

(C) Adjacent section hybridized with sense strand (control) probe. Arrows indicate absence of signal at same locations. 



tip. This cross-section also shows that no significant hy- 
bridization occurred over the now mature meta- and proto- 
xylem elements of the primary root. 



acidic and basic amino acids, a feature also observed in 
putative cell wall proteins from dicots (Franssen et al., 
1987; Hong et al., 1987). The remaining 248 amino acids 



DISCUSSION 

We have determined the DNA sequence and expression 
pattern for a maize gene encoding a highly repetitive 
proline- and threonine-rich protein. The deduced amino 
acid sequence suggests that it corresponds to an HRGP 
that can be extracted from cell walls of several maize 
tissues (Kieliszewski and Lamport, 1987; Hood et al., 
1988; Kieliszewski et al., 1990). 

The predicted protein primary structure included several 
distinct blocks that may have distinct wall structural func- 
tions. A hydrophobic signal sequence was located at the 
N terminus and was absent in the mature protein, presum- 
ably due to processing that accompanies its passage 
through the endoplasmic reticulum to the wall (Gardiner 
and Chrispeels, 1975; Von Heijne, 1981). The mature 
protein began with an eight amino acid hydrophobic gly- 
cine-tyrosine-rich region. At least one dicot glycine-rich 
protein is highly specific for tracheary element walls (Keller 
et al., 1989b). The glycine-rich region was followed by a 
short stretch of proline-rich sequence with alternating 




Figure 6. Localization of HRGP Transcripts in Plumules. 

Cross-sections through plumules of 6 day post-germination plants 
were hybridized with the HRGP probe. 

(A) Plumule cross-section, col, coleoptile; 1,2,3, leaves 1 , 2, and 
3. Signals are present at coleoptilar veins and at major veins of 
leaf 1. 

(B) Enlargement of region including coleoptilar vein and adjacent 
leaf 2 vein. Signals over both veins indicated by arrows. 




1 



.4 





Figure 7. Localization of HRGP Transcripts in Sheaths of Devel- 
oping Leaves. 

Cross-sections through 14 day post-germination shoots at the 
level of the developing sheaths of leaves 1 and 2 were hybridized 
with the HRGP probe. 

(A) Sheath regions of leaves 1 (1) and 2 (2). Signals are present 
only over veins of leaf 2. 

(B) Enlargement of vein from leaf 2. Signal is apparent over xylem 
and adjacent sclerenchyma (arrows). 



formed blocks of repetitive sequences in which tyrosine 
residues were regularly located in the most hydrophobic 
part of the molecule (Stiefel et al., 1988). Tyrosine and 
threonine residues were predominantly located in PPty 
peptides, a motif that is repeated in a tobacco HRGP 
associated with lateral root initiation (Keller and Lamb 
1959). Tyrosine residues have been proposed as sites of 
extensin polymerization (Fry, 1986). The hydrophobic en 
vironment of regularly spaced tyrosines in this maize 
HRGP might permit a polymerized hydrophobic surface on 
one face with a hydrophilic opposite surface, although this 
pattern would be altered by modifications such as qlyco 
sylahons. proline hydroxylations, and tyrosine polymeri- 
zation. Near the C terminus is a single Ser-Pro, sequence 

Varner! S 19 a 89T° tif eXt<5nSinS < Showa,ter and 

The localization of the maize HRGP mRNA and the 
corresponding protein suggests that it plays a role in the 
early construction of walls surrounding a vascular element 
or elements common to many organs. We showed that 
mRNA accumulates transiently at new vascular sites in 
embryos, leaves, and roots. This pattern is similar to that 
observed in bean for a glycine-rich protein (GRP), which 
appears to be a component of xylem walls (Keller et al 
1988). This maize HRGP mRNA is specifically induced at 
wound sites in young tissues (Ludevid et al., 1990) This 
pattern may represent the regeneration of vascular ele- 
ments to circumvent wounded sites, although the rapid 
time course (mRNA peak at 1 hr to 2 hr) makes it less 
likely. 

The highly localized accumulation of the mRNA at sites 




Figure 8. Localization of HRGP Transcripts in the Primary Root. 
r2 q H? eCtl0nS thr ° U9h tip re9ion of 10 dav Post-germination 

hybridized with the HRGP A "°w" 

indicate hybnd.zat,on signal over differentiating meta- (mx) and 
protoxylem (px) elements, c, cortex. 



Maize HRGP Expression in Vascularization 




Figure 9. Localization of HRGP Transcripts in Developing Lateral 

Roots. 

Cross-sections through 10 day post-germination root systems 
were hybridized with the HRGP probe. This section through the 
pnmary root is at the level of a newly initiated lateral root c 
cortex; e, endodermis; pc, pericycle; px, protoxylem; mx, meta- 
xylem. No signal is present in primary root. Arrow indicates 
considerable signal over developing vascular bundle of lateral 
root. 



of lateral root initiation might also be interpreted as being 
at sites of wounding, as the initiated lateral root penetrates 
existing tissue. However, our in situ hybridization patterns 
showed that the HRGP mRNA is associated with the 
developing vascular link of the new lateral root to the 
pnmary root, rather than at the "wounding" tip. This pattern 
is in contrast to the accumulation pattern of a distinct 
HRGP gene in tobacco, which is expressed only at the 
tips of new lateral roots and which may have a mechanical 



role in penetration of new roots through existing tissue 
(Keller and Lamb, 1989). However, we also observed 
accumulation of the maize HRGP in regions associated 
with xylem differentiation in the primary root, where no 
wound-like process occurs. The distinct patterns of gene 
expression and protein accumulation for various dicot and 
monocot HRGPs suggest that they constitute a family of 
proteins with diverse roles in plant walls. 



METHODS 



Genomic Blot Analysis 

Genomic DNA was prepared from leaves of maize inbreds W64A 
A188, Palomero Toluqueno, and Black Mexican Sweet varieties 
and from teosinte (Zea diploperennis) and sorghum (Sorghum 
btcolor), as described by Burr and Burr (1981). The DNA was 
digested with restriction enzymes, separated by size in 0 8% 
agarose gels, and blotted onto nylon membranes as recom- 
mended by the manufacturer (Zeta-Probe, Bio-Rad). Hybridization 
was carried out in 180 mM NaCI, 10 mM sodium phosphate 10 
mM EDTA, 1% SDS, 0.5% nonfat milk, 0.5 mg/mL sonicated 
salmon sperm DNA at 68°C. The DNA probe was labeled with 
P to a specific activity of 0.5 to 2 x 10 9 cpm/ M g by random 
pnmmg (Boehringer Mannheim). Final washes were done in 0 1 x 
SSC, 0.1% SDS at 65°C, and the membrane was exposed to 
Kodak XAR5 film with intensifying screens (DuPont Lightning Plus) 



Genomic Cloning 

A maize inbred AC1503 genomic library (kindly provided by A 
Gierl, Koln; Sau3A partial library in AEMBL3) was screened with 
the insert of the pMC56 HRGP cDNA clone (Stiefel et al 1988) 
by standard methods (Sambrook et al., 1989). A 5-kb genomic 
clone hybndizing with the probe was sequenced using the dideoxy 
method after subcloning in M1 3mp1 8 and 1 9 (Sanger et al. , 1 977); 
1 00% of both strands were sequenced over the 5-kb region. 

Protein Analysis 

Protein for amino acid analysis and N-terminal sequencing was 
extracted (Mazeau et aJ., 1982) from the coleoptiles of 6-day-old 
maize plantlets (inbred W64A) and deglycosylated as previously 
described (Stiefel et al., 1988). Amino acid analyses of purified 
HRGP were carried out after acid hydrolysis in a Pico-Taq (Milli- 
pore) analyzer according to the manufacturer. N.terminal amino 
acid sequence was determined in a gas-phase sequencer (model 
470A, Applied Biosystems) according to the manufacturer. 

In Situ Hybridization 

For post-embryonic materia], histological techniques and in situ 
hybridization were performed on paraffin-embedded samples as 
previously described (Langdale et al., 1 988). Frozen sections were 
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used for embryos. Embryos germinated for 48 hr were embedded 
and frozen in OTC (Tissue Tek) compound on dry ice. Cryostat 
(Reichert-Jung) sections 8 ^m to 10 thick were collected on 
gelatin-subbed slides, dried on a hot plate at 60°C for 1 min, and 
fixed in a freshly made 4% paraformaldehyde solution in PBS 
(0.15 M NaCI, 10 mM sodium phosphate, pH 7.4). ^S-labeled 
riboprobes were synthesized with T3 and T7 RNA polymerases 
(Bethesda Research Laboratories), using linearized cDNA sub- 
clones derived .from pMC56 as template. 
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Abstract 

Reduction of turgor in pea shoots caused the accumulation of several poly(A) RNAs. cDNA clones 
derived from three different poly(A) RNAs which accumulate in wilted pea shoots were isolated, 
sequenced and expression of the corresponding genes examined. Clone 7a encoded a 289 amino acid 
protein. The C-terminal 180 amino acids of this protein were homologous to soybean nodulin-26. RNA 
hybridizing to cDNA 7a was abundant in roots, and induced in shoots by dehydration, heat shock and 
to a small extent by ABA. Hydropathic plots indicate that the protein encoded by cDNA 7a contains 
six potential membrane spanning domains similar to proteins which form ion channels. Clone 15a 
encoded a 363 amino acid protein with high homology to cysteine proteases. RNA hybridizing to cDNA 
15a was more abundant in roots than shoots of control plants. Dehydration of pea shoots induced cDNA 
15a mRNA levels whereas heat shock or ABA treatment did not. Clone 26g encoded a 508 amino acid 
protein with 30% residue identity to several aldehyde dehydrogenases. RNA hybridizing to cDNA 26g 
was induced by dehydration of shoots but not roots and heat shock and ABA did not modulate RNA 
levels. Levels of the three poly(A) RNAs increased 4-6-fold by 4 h after wilting and this increase was 
not altered by pretreatment of shoots with cycloheximide. When wilted shoots were rehydrated, RNA 
hybridizing to cDNA 26g declined to pre-stress levels within 2 h. Run-on transcription experiments using 
nuclei from pea shoots showed that transcription of the genes which encode the three poly(A) RNAs was 
induced within 30 min following reduction of shoot turgor. One of the genes showed a further increase 
in transcription by 4 h after dehydration whereas transcription of the other 2 genes declined. These results 
indicate that plant cells respond to changes in cell turgor by rapidly increasing transcription of several 
genes. Furthermore, the expression of the turgor-responsive genes varies with respect to the time course 
of induction and reversibility of the wilting-induced changes. 



12 



Introduction 

Plant growth and development depend on the 
acquisition of water from the soil. As a con- 
sequence, exposure of plants to water-limiting 
environments causes a diverse set of metabolic 
and developmental changes. The specific plant 
response to water deficit depends on the severity 
and duration of the water deficit, the stage of 
development and plant species examined. For 
example, mild water deficit often induces stomatal 
closure which reduces water loss from leaves. 
Stomatal closure also limits gas exchange and 
inhibits photosynthesis. Some plants respond to 
this situation by switching from C 3 to CAM 
photosynthesis which increases water use effi- 
ciency [rev. 59]. Shoot growth is also very sensi- 
tive to water deficit [3, 9, 10, 49]. Associated with 
inhibition of shoot growth are decreases in poly- 
somes and changes in poly(A) RNA populations 
[1, 2, 10, 41, 42]. Interestingly, shoot growth is 
more sensitive than root growth to water deficit 
[10, 64]. This is due in part to differential sensi- 
tivity of roots and shoots to abscisic acid (ABA) 
which increases 10- to 50-fold in water-deficient 
plants [ 10] . The mode of action of ABA is unclear 
but this growth regulator is known to alter 
K + /H + flux [5, 55]. Other plant growth regula- 
tors may modulate additional responses of plants 
to drought [rev. 67]. 

Cells of well watered plants maintain a positive 
cell turgor pressure of 0.4 to 0.8 MPa. Exposure 
of plants to low water potential soil or conditions 
where transpiration exceeds the capacity of roots 
to supply water to shoots often leads to loss of cell 
turgor. As in unicellular organisms [rev. 30], loss 
of plant cell turgor can lead to osmotic adjustment 
and the accumulation of compounds such as 
sucrose, proline and glycine betaine [22, 43, 65]. 
Sugar accumulation in hypocotyls of water- 
deficient soybean seedlings occurs primarily 
because growth decreases while sugar uptake 
continues [43]. Accumulation of proline and 
glycine betaine, however, involves increased rates 
of synthesis and changes in gene expression [26, 
65]. Loss of turgor in plants also induces the 
synthesis of ABA [ 12, 52], which is derived from 



a preexisting pool of xanthophylls such as 
violaxanthin [11, 19]. The induction of ABA 
synthesis is rapid (~30min), requires nuclear 
gene expression [23, 54] and could involve pres- 
sure-sensitive receptors [6] or ion channels [46]. 
Increased levels of ABA can in turn induce 
changes in gene expression [4, 7, 8, 10, 18, 20, 21, 
24, 27, 28, 31, 36, 40, 44, 47, 56, 57]- The expres- 
sion of ABA-responsive genes is modulated dur- 
ing seed development [7, 8, 18, 20, 27, 31, 36, 40, 
47, 57, 69], in response to plant dehydration [4, 
8, 10, 20, 24, 28, 47] or low temperature [cf. 44, 
56]. 

Recently, we reported that several pea shoot 
poly(A) RNAs increased in abundance when pea 
shoots were wilted [24]. cDNAs corresponding 
to several induced genes were isolated [24]. In 
this paper , we report the sequences of three 
cDNAs and show that the corresponding genes 
are rapidly induced at the transcriptional level 
when the turgor of pea shoots is reduced to near 
zero. The expression of these genes in leaves, 
stems and roots was examined and induction was 
found to be insensitive to cycloheximide. 



Materials and methods 

Plant growth and dehydration treatments 

Plant growth and shoot treatments were carried 
out as reported earlier [24]. Pisum sativum 
(Progress No. 9) was grown for 10-12 d at 23 °C 
in well watered vermiculite using a model El 5 
Conviron growth chamber. Plants were illumi- 
nated (70 /xE m " 2 s - 1 ) for 18 h followed by 6 h of 
darkness. Plant shoots (stems plus leaves) were 
cut and the stem base quickly submerged under 
water where a portion of the lower stem was recut. 
Each shoot was placed in a 0.5 ml microfuge tube 
containing 10 mM Hepes-KOH (pH 8.0) for a 1 h 
hydration pretreatment with illumination at 
23 °C. Shoots to be wilted were removed after 
pretreatment, weighed, then wilted in darkness 
under a stream of 23 °C air for 20min until a 
10-15% loss of fresh weight occurred. Wilted 
shoots were then incubated in a dark humid 



chamber for 0, 0.5, 1 or 4 h prior to further 
analysis. One set of shoots was rehydrated by 
reexcision of 4 h wilted plant stems under water 
then submerging the stem in pretreatment buffer 
for 2 h in the light at 23 °C. For heat shock treat- 
ment, shoots were excised and placed in pre- 
treatment solution for 1 h in the dark at 38 °C. 
Stems were kept submerged to help prevent shoot 
dehydration and any wilted shoots were dis- 
carded. Shoots used for examining the effects of 
exogenous ABA were excised as above and 
placed in solutions of 10 mM Hepes-KOH 
(pH8.0) which contained 18 jzM (±) abscisic 
acid for 1 h in the dark at 23 °C. Cycloheximide- 
treated shoots were pretreated for 1.5 h in the light 
at 23 °C in 10 mM Hepes-KOH (pH 8.0) which 
contained 20 /ig/ml cycloheximide followed by 
wilting and a 4h incubation period. For one 
experimental condition, plants were grown hydro- 
ponically in aerated half-strength Hoagland's 
solution in the growth chambers described above. 
Water deficit was imposed on some of these 
plants by making the hydroponic medium 0.6 M 
mannitol. 



RNA and DNA sequence analysis 

Total RNA was isolated and 1 analyzed by 
northern blotting as previously described [24] 
except that when utilized, LiCl precipitation of 
RNA was done without ethanol. Preparation of 
probes from selected cDNA clone inserts and the 
hybridization conditions have also been described 
[24]. Dideoxy sequencing reactions were per- 
formed on double-stranded plasmid DNA, using 
a 35 S-ATP (from NEN), custom-made oligo- 
nucleotide primers, and the Sequenase kit (U.S. 
Biochemical Corp.). DNA sequencing ladders 
were analyzed on 5% polyacrylamide 8.3 M urea 
gels and both DNA strands were sequenced to 
completion. DNA sequences were analyzed using 
the Intelligenetics computer programs (Intelli- 
genetics, Inc., Mountain View, CA). 
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Transcription assays of isolated nuclei 

Pea shoot nuclei were isolated using Percoll step 
gradients by a method similar to Hagen and 
Guilfoyle [25]. Approximately 10 g of pea shoots 
were rinsed with water followed by a 1 min soak 
in ice-cold diethyl ether and a quick rinse in 
nucleus isolation buffer (NIB; 10 mM Tris-HCl 
(pH7.5), 5mM MgCl 2 , 1M sucrose, 10 mM 
/?-mercaptoethanol). The remaining steps were 
done in a cold-room (4 °C) or on ice. All centrifu- 
gations were done at 4 °C. Shoots were quickly 
chopped with scissors, 75 ml NIB added and 
tissue homogenized using a Polytron (Brinkman 
PT-10 generator) for l min at medium high 
setting. The mixture was filtered through Mira- 
cloth (Calbiochem) and the filtrate centrifuged 
10 min at 2500 g. The pellet was resuspended with 
5 ml of NIB and further homogenized with 20 
strokes in a tight-fitting Dounce homogenizer. 
The solution was split into two portions, layered 
onto separate Percoll step gradients consisting of 
25, 37 and 50% Percoll in NIB, and centrifuged 
15 min at 7700 g. The nuclei at the 37% -50% 
Percoll interface were removed, mixed with 10 ml 
of nuclei suspension buffer (25% v/v glucerol, 
5 mM Mg acetate, 50 mM Tris-HCl (pH 8.0), 
5 mM dithiothreitol, 0.1 mM EDTA) and recen- 
trifuged for 10 min at 2500 g. The pellet of nuclei 
was resuspended in 2 ml of suspension buffer and 
recentrifuged. The resulting pellet was resus- 
pended to a final volume of 250 /il in suspension 
buffer and either used immediately for in vitro 
transcription reactions or frozen at - 80 °C. 

Transcription in isolated nuclei and isolation of 
RNA was as described by Love and Minton [37]. 
Each reaction containing 500 /xCi [a- 32 P]-UTP 
(800 Ci/mmol, New England Nuclear). Reaction 
times varied from 6-60 min depending on the 
experiment. Incorporation of radiolabel into 
RNA was determined by spotting aliquots onto 
Whatman 3MM paper and assaying precipitable 
counts by liquid scintillation counting as described 
in Maniatis etal [39]. 
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Chemicals 

Glycerol, agarose, guanidinium isothiocyanate, 
CsCl and dithiothreitol were purchased from 
Bethesda Research Laboratories. Percoll was 
obtained from Pharmacia. Nucleotide triphos- 
phates were from Pharmacia and ultra-pure urea 
from Schwartz-Mann. Unless noted otherwise, 
all enzymes were purchased from Bethesda 
Research Laboratories and all other chemicals 
from Sigma Chemical Co. 



Results 

Expression of dehydration-inducible genes in intact 
plants 

Earlier studies [24] identified several cDNAs 
which correspond to genes whose poly(A) RNA 
levels increase when excised pea shoots are wilted 
and incubated in darkness. The genes' expression 
could have been altered due to plant excision [59, 
62], loss of cell turgor, transfer of plants to dark- 
ness or to changes in plant growth regulator 
levels. To distinguish among the first three possi- 
bilities, pea plants were grown hydroponically in 
continuous light for 10 days. At this point, the 
medium was adjusted to 0.6 M mannitol. This 
treatment caused leaf wilting to occur. The plants 
were maintained for 4 h in the light in mannitol 
and then leaves were harvested for RNA extrac- 
tion and northern blot analysis. The northern 
blot in Fig. 1 shows that mannitol treatment 
caused RNA hybridizing to clones 14d (a control) 
and 45c to decrease in abundance. Previously, 
RNA hybridizing to clone 45c increased when 
wilted pea shoots were placed in darkness (clone 
F in ref. 24). Other experiments showed that this 
gene is induced by transfer of non-wilted plants 
from light to darkness (data not shown). In con- 
trast, RNA hybridizing to clones 7a, 15a and 26g 
(clones C, D and E in ref. 24) increased in abun- 
dance when roots of intact pea seedlings were 
treated with mannitol and were not affected by 
transfer from light to darkness. This result is con- 
sistent with previous data showing that these 



genes are induced when pea leaf cell turgor is 
reduced to near zero [24]. 

Expression of dehydration-inducible genes in roots , 
stems and leaves 

Expression of the genes corresponding to cDNAs 
7a, 15a, 26g and 14d was examined in excised 
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Fig. 1 . Northern analysis of RN As extracted from the shoots 
of control hydroponically grown pea plants (lane 1) and 
similar plants whose roots were exposed to 0.6 M mannitol 
(lane 2). Five /ig of total RNA from control and mannitol- 
treated plants was separated on an agarose gel, blotted and 
probed using nick-translated inserts from cDNA clones 14d, 
7a, 15a, 26g and 45c as described previously [24]. 
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hydrated leaves, stems and roots and after these 
plant parts had been dehydrated to reduce fresh 
weight by 10% (leaves, stems) or exposed to 
0.6 M mannitol (roots). In hydrated tissue, tran- 
scripts hybridizing to control gene 14d are most 
abundant in leaves and undetectable in roots 
(Fig. 2). The level of 14d transcripts declined in 
dehydrated leaves and stems. In contrast, 7a and 
15a transcripts levels were most abundant in 
hydrated roots with lower expression in leaves. 
RNA levels of these genes increased in leaves and 
stems when these plant parts were dehydrated. A 
small increase in 15a and 7a RNA levels occurred 
when roots were treated with 0.6 M mannitol. 
RNA hybridizing to the gene corresponding to 
cDNA 26g could be detected in control roots and 
stems and its level increased when leaves or stems 
were diehydrated. 

LEAF STEM ROOT 
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Fig. 2. Northern analysis of total RNAs from leaves (lanes 1 , 
2), stems (lanes 3, 4) and roots (lanes 5, 6). Leaves and stems 
were excised from hydrated plants and then incubated in 
humid boxes for 4 h (lanes 1, 3) or dehydrated to reduce fresh 
weight 10% and then incubated for 4 h (lanes 2, 4). Roots 
were excised and incubated in aerated water (lane 5) or 
0.6 M mannitol for 4 h (lane 6). RNA extraction and analysis 
was carried out as described in Fig. 1. 



Time course of changes in RNA levels induced by 
wilting and influence of heat shock, ABA , rehydration 
and cycloheximide 

Changes in the abundance of RNAs hybridizing 
to cDNA clones 14d, 7a, 15a and 26g was deter- 
mined as a function of time after plant de- 
hydration, and in heat-shocked and ABA-treated 
plants (Fig. 3). This experiment showed that 
RNA hybridizing to clones 7a and 15a increased 
within 0.5 h after plant dehydration and that 
RNA levels continued to increase for at least 4 h 
if the reduced turgor state was maintained. RNA 
hybridizing to clone 26g also increased but the 
time course of induction was slower. We previ- 
ously found that transfer of pea shoots to 38 °C 
for 1 h induced a large set of genes which 
presumably encode heat shock proteins [24]. 
This treatment therefore was used to test if high 
temperature would induce the level of RNA 
hybridizing to clones 7a, 15a or 26g. The results 
in Fig. 3 show that heat shock treatment 
increased the level of RNA hybridizing to clone 7a 
but not 15a or 26g. The ability of ABA to increase 
RNA levels of the dehydration-inducible genes 
was tested next. Excised shoots were fed 18 /zM 
ABA through the transpiration stream for 1 h. 
This treatment caused ABA levels to increase 
from 0.01 /ig/g fresh weight to 0.1/xg/g fresh 
weight [24]. ABA accumulates to this level in 
wilted pea shoots within 1 h and induces changes 
in poly(A) RNA populations [24]. Treatment for 
4 h did not increae ABA levels further, presuma- 
bly because stomatal closure blocked further 
uptake via the transpirational stream. In the pres- 
ent study, treatment of seedlings for 1 h with 
18 /xM ABA increased the level of RNA hybridiz- 
ing to clone 7a to a small extent. RNA levels of 
clones 14d, 15a and 26g were not altered by ABA 
treatment. 

The results in Fig. 3 were verified and extended 
by northern analysis of total RNA (Fig. 4). Total 
RNA was isolated from control shoots pretreated 
for 1 h in Hepes buffer and shoots which had been 
pretreated, wilted and incubated 0.5 and 4h 
(Fig. 4). In addition, RNA was isolated from 
shoots which had been pretreated, wilted and 
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Jfe. 5. Time course of changes in RNA levels induced by 
dehydration, heat shock or ABA. Poly(A) RNA was ex- 
tracted from the shoots of control plants (C), excised shoots 
hydrated for 1 h (H), shoots dehydrated to produce a 10% 



incubated for 4 h, and then rehydrated again by 
cutting the stem under water and submerging the 
stem in Hepes buffer for 2 h in the light at 23 °C 
(Fig. 4, lane 4). Cycloheximide (20 /zg/ml) was fed 
through the transpirational stream to another set 
of excised shoots for 1.5 h followed by wilting 
and a 4 h incubation period (Fig. 4, lane 5). This 
treatment blocks cytoplasmic translation [48]. 
The northern blot in Fig. 4 confirmed the induc- 
tion time course of RNAs hybridizing to clones 
7a, 15a and 26g. When shoots were rehydrated for 
2h, RNA hybridizing to clone 26g declined 
whereas RNA hybridizing to clones 7a or 15a did 
not decline. This could indicate that these latter 
RNAs are quite stable or that rehydration does 
not cause a rapid decrease in transcription of 
these genes. This figure also shows that cyclo- 
heximide, an inhibitor of protein synthesis on 80S 
ribosomes, did not prevent dehydration-induced 
increases in RNA. This result indicates that no 
new proteins need to be synthesized in order to 
induce the levels of RNA hybridizing to clones 7a, 
15a and 26g. 



Transcription in isolated nuclei 

Nuclei were isolated from control and 4 h wilted 
pea shoots and utilized for optimization studies of 
in vitro transcription reactions. To maximize the 
incorporation of radioactivity into RNA poly- 
merase II transcripts, the K + optimum for RNA 
polymerase II in our system was determined. 
In vitro transcription reactions were done for 
60min and a-amanitin (10/ig/ml) was used to 
determine the contribution of RNA polymerase II 
to total transcription. This concentration of 



weight loss (D), and dehydrated shoots incubated in a humid 
chamber for 0.5, 1, or 4 h. Other excised shoots were heat- 
shocked at 38 °C or treated with 18 //M ABA for 1 h prior 
to RNA extraction. Poly(A) RNA from each sample was used 
for cDNA probe preparation. Radiolabeled cDNA probes 
were hybridized to duplicate nylon filters containing 0.5 fig 
DNA of each gene of interest. After hybridization and wash- 
ing, DNA-containing dots were excised and hybridizing 
radioactivity determined by scintillation counting. This 
experiment was done in triplicate and standard deviations 
are indicated for each point. 
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//g. 4. Northern analysis of changes the abundance of 
RNAs hybridizing to control cDNAs (2c, 14d) and to 
dehydration-inducible cDN As (7a, 1 5a, 26g). Total RNA was 
extracted from hydrated pea shoots (lane 1), pea shoots 
dehydrated and incubated for 0.5 or 4 h (lanes 2, 3), shoots 
dehydrated and incubated for 4 h which were then re- 
hydrated by incubation in buffer for 2 h (lane 4), and shoots 
pretreated with cycloheximide prior to dehydration and 4 h 
incubation (lane 5). RNA blots were prepared and probed as 
described in Fig. 1. 



a-amanitin completely inhibits transcription by 
wheat germ RNA polymerase II [34]. These 
results are illustrated in Fig. 5 and show that 
RNA polymerase II transcription activity was 
maximal at 0.65 M K + , similar to that found with 
Drosophila nuclei [37]. 

Transcription reaction characteristics were 
examined using 6 min reactions, 0.65 M K + , and 
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Fig, 5. Potassium optimization of transcription assays in iso- 
lated pea shoot nuclei. The upper line (open squares) denotes 
reactions without a-amanitin, the lower line (closed squares) 
were reactions carried out in the presence of 10 /ig 
a-amanitin. The line marked by closed triangles denotes 
a-amanitin-sensitive incorporation of 32 P-UTP into RNA in 
isolated nuclei. 

0.65 /iCi/jul [ 32 P]-UTP (Table 1). Omission of 
unlabeled NTPs from the reaction solution sig- 
nificantly reduced the radiolabel incorporation 
into RNA. Including actinomycin D or a- 
amanitin at 10 /ig/ml reduced the incorporation 
in both control nuclei and in nuclei from wilted 
shoots (Table 1). In an earlier experiment using 

Table 1. Transcription in nuclei isolated from pea shoots 



Sample 


Percent of transcription 




by control nuclei 1 


Nuclei from control shoots 




Complete mix 


100 


- unlabeled NTPs 


29 


+ 10 fig/ml a-amanitin 


13 


+ 10 ^g/ml actinomycin D 


20 


Nuclei from 4 h wilted shoots 




Complete mix 


74 


- unlabeled NTPs 


29 


+ 10 /ig/ml a-amanitin 


16 


+ 10 ^g/ml actinomycin D 


33 



1 All reactions were carried out for 6 min. The complete 
transcription mix from control shoots contained 1.34 x 10 5 
nuclei. Incorporation over the 6 min assay was 
2.39 x 10 4 cpm (100% value). 
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6, Transcription of the genes corresponding to cDNAs 
14d, 7a, 15a, and 26g in isolated nuclei. Pea shoots were 



0.56/iCi//xl [ 32 P]-UTP, the omission of nuclei 
resulted in precipitable radioactivity of 6% of the 
complete mix control (data not shown). The 
transcription activity of nuclei isolated from 4 h 
wilted shoots is 74% that of control shoot nuclei, 
indicating that wilting caused an overall inhibition 
of transcription activity. 



Transcription of turgor-responsive genes 

Transcription of the genes corresponding to 
cDNAs 14d, 7a, 15a and 26g was assayed in 
nuclei isolated from hydrated and dehydrated pea 
shoots. Nuclei were isolated from control shoots 
and shoots wilted and incubated for 0.5 and 4 h. 
RNA synthesized by nuclei in vitro was used to 
probe a set of replica filters containing DNA from 
cDNAs 14d, 7a, 15a and 26g. The results from 
this experiment are shown in Fig. 6. Transcription 
from the gene corresponding to clone 14d was not 
induced by wilting. Transcription of RNA hybrid- 
izing to clones 7a and 15a showed approximately 
a 3-fold increase by 0.5 h after shoot dehydration. 
However, the activity returned to approximately 
the pre-stress level by 4 h. In contrast, RNA levels 
for these genes did not decline during this period. 
The gene corresponding to clone 26g showed 
approximately a 2-fold induction of transcription 
activity at the 0.5 h point. Transcription activity 
for this latter gene continued to increase to a level 
approximately 6-fold over the control level by 4 h. 
This increase in transcription was paralleled by 
increases in RNA level (Fig. 4). 

A second set of nuclei isolated from control 
shoots and shoots wilted and incubated for 0.5 
and 4 h were used to repeat the experiment 
reported above. For these nuclei, 30 min 
reactions were utilized for one set of control, 0.5 
and 4 h dehydrated shoots and 30 min reactions 

excised at time (C), hydrated for 1 h (H) and then dehydrated 
until a loss of 10% fresh weight had occurred (D). De- 
hydrated plants were incubated for up to 4 h in a humid 
chamber. Nuclei were isolated at the times indicated and 
allowed to incorporate [ 32 P]-UTP into RNA for 1 h. The 
radiolabeled RNAs were then extracted and hybridized to 
DNA fixed to nylon membranes. 
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followed by a 15min chase with 0.5 raM 
unlabeled UTP for a second set of control and 4 h 
treated shoots. These experiments gave qualita- 
tively similar results to those seen in Fig. 6, differ- 
ing primarily in the amount of radioactivity 
hybridizing to the filter-bound DNA but not in the 
induction trends. In addition, no reduction in 
hybridizable RNA was evident in pulse-chase 
experiments and the size radiolabeled RNAs did 
not change significantly during the chase (data 
not shown). 



DNA sequence analysis 

The size of the RNAs hybridizing to the cDNAs 
7a, 15a and 26g were 1.2, 1.5 and 1.8 kb repec- 



tively. The cDNA inserts in clones 7a, 15a and 
26g were close to the size expected for full-length 
copies of the mRNAs. To obtain further informa- 
tion concerning the products encoded by the 
cDNAs, DNA sequence analysis was carried out 
(Fig. 7A-C). Each cDNA sequence contained a 
single large open reading frame. The deduced 
amino acid sequences generated from the open 
reading frames are shown in Figs. 7A-C. Each 
deduced amino acid sequence was compared to 
sequences compiled in the Intelligenetics Protein 
Identification Resource. 

Sequence analysis of.cDNA 7a revealed an 
open reading frame encoding a putative 289 
amino acid protein (Fig. 7 A). The C-terminal 180 
amino acids of the deduced amino acid sequence 
had 33% overall residue identity with soybean 



432 



iTTTT 504 



AATTCCGCAAGAAACCAGAACAGTCTAGAAAGAAACATGGAAGCCAAGGM 12 

MEAKEQDVS LGA 

AACAAGTTTCCAGAGAGACAGCCACTCGGGATTGC^ 144 
NKFPERQP LGIAAQSQDEPKDYQE 

CCACCACCC^GCCACTTTTTGAGCCCTCGGAGCTGAC 216 
PPPAPLFEPSELTSWSFYRAGIAE 

TTCATAGCCACTTTCCTTTTCCTTTATATAACCG 288 
FIATFLFLY ITVLTVMGVVRES SK 

TGCAAAACGGTTGGTATTCMGGAATCGCTTGGGCT^ 360 
CKTVGIQG IAWAFGGMIFALVYCT 

GCTGGMTCTCAGGGGGTCATATAAACCCAGCTGTGAC^ 

AGI SGGHINPAVTFGLFLARK SL 

ACAAGAGCAATATTCTACATGGTGATGCAAGTGTTGGGTGCTATATGCGGTGCTGGrc 
TRAIFYMVMQVLGAICGAGVVKG 

GAAGGTAAACAAAGGTTTGGAGATCTGAACGGTGGT 576 
EGKQRFGD LNGGANFVAPGYTKGD 

GGACTTGGTGCTGAAATTGTTGGCACTTTCATTCTTGTATAC 648 
GLGAEIVGTFI LVYTVFSATDAKR 

AGCGCCAGAGACTCTCATGTTCCTATTTTX5GCACCGTTGCCAATTGGATTC 720 
SARDSHVP I LAPLPIGFAVFL.VHL 

GCAAC TATACCAATTACTGGAAC TGGTATTMCCCTGCCAGAAGTC TTGGTGC TTCAACAAG 792 

AT I P I TG TG I NPA RSLGAAIV F N K 

AAAATTGGTTGGAATGATCACTGGATTTTCTG&G 864 
KIGWNDHWIFWVGPFIGAALAALY 

CATCAAG TTG TTATCAGAGCCATTCCC TTCAAG TC TMGTGATTGAATTGAATCAAAC GGTTC TTGATCAAT 936 
HQVVIRAIPFKSK . 

CATATCATGTTTGTGGATTTCATTTCATO 1008 

TTAMTGTTTGTTTTGTMTTTGTATGTAAAGAAGTACTAm 1080 

GTTGCGTAATGGGTCCAGCTCTCGTGTGTGAATGTGTATTTTGCCAM 1152 

TTTCTGAAAAAAAAGG fig y A 1168 

Fig. 7. DNA sequences of cDNA 7a (Fig. 7A, this page), cDNA 15a (Fig. 7B) and cDNA 26g (Fig. 7C). Deduced amino acid 
sequences are shown below the corresponding DNA codons. Amino acid regions which have sequence similarities to other 

proteins are underlined and discussed in the text. 
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MTTCCGCTCTTCTCACCCAACGAACCTTCGCTCC^ 72 

MDRRFLFALFL 

TCTTCGCCGCCGTAGCAACCGCCGTMCCGACGACACCAACAACGACGACTTCATAATCCGTCA 144 
FAAVATAVTDDTNNDDF I IRQVVD 

ATAACGAAGAAGATCACTTGTTAAACGCAGAGCATCACTTCACTTC^ 216 
NEEDHLLNAEHHFTSFKSKFSKSY 

AC GCAACAAAAGAGGAACACGATTACCGATTCGGCGTG TTCA TCCACC 288 

ATKEEHDYRFGVFKSNLI KAKLHQ 

agaatcgcgatcctactgcg^gcacg^mtaactaagttc 3 60 

nrdptaehgitkfsdltasefrrq 



FLGLKKRLRLPAHAQKAP I LPTTN 
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ACCTCCCGGAGGACTTCGATTGGCGTXy^GAMGGCGCCGTTACTCCCXjTCAAGG 504 
L.PEDFDWREK GAVTPVKDQGSCGS 

CG TGTTGGGCG TTTAGCAC AACCGGAGCTTTAGAAGGAGCACACTATCTGGCTAC TGGGAAGCTTG TAAGCC 57 6 
CWAFSTTGALEGAHYLATGKLVSL . 

TTAGCGAACAACAGCTTGTGGATTGTGATCATGTGTCTC 648 
SEQQLVDCDH VCDPEQAGSCDSGC 

GTAATGGTGGATTX^TGAACAATGCTTTCGAATA 720 
NGG LMNNAFEYLLE SGGV.VQEKDY 

ATGCTTACACCGGAAGAGATGGCTCCTGCAMTTT^ 7 92 

AYTGRDGSCKFDKSKVVASVSNFS 

G TG TGG TTACCCTAGACGAAGACC AAATTGC TGC AAATCTAG TAAAGAATGGCCC TC TTGCAG TTGC TATT A 864 
VVTLDEDQIAANLVKNGPLAVAIN 

ATGCAC^CTGGATGCAGACATACATGAGTGGCGTCTCATGCCCATATGTCTGTX^ 936 
AAWMQTYMSGVSCPYVCAKSRLDH 

ATGGTGTTCTTCTAGTTGGTTTTGGAAAAGGTGCTTATGCTCCCATTCGATTGAAG 1008 
GVLLVGFGKGAYAPIRLKEKPYWI 

TCATTAAGA7VCTCCTGGGGGCAGAATTGGGGAGAGCAGGGATATTACAAGATCTGCAGAGGTAGA 1080 
I KNSWGQNWGEQGY YKICRGRNVC 

GTGGAGTCGATTCAATGGTTTCAACTGTAGCTGCAGCTCMTC^ 1152 
GVD SMVSTVAAAQSNH . 

AGTC TG TC TTC ATC TCATG TGAC TC TTAAG TTAGTTGAATT TG TG TAAAT ATATTATCATGAAG TTG TGAAT 1224 

GTATCrTCTGGTTATGGCAAAMCTATCTCTMGTAGGATCAAG^ 1296 

GTTGGTTATGTCTTGCTGTTACTGTGGTGATGm 1368 

GAAMTAATTATAAACTTTATTATTTTATGAGTTTATGGAACAAC^ 1440 

AAAAAAAGGAATT 1453 

Fig. 7B. 



nodulin-26 [17]. High amino acid sequence simi- 
larities were detected between amino acids 1 12 to 
122 (10 of 11 identical) and 239 to 250 (9 of 11 
identical) (regions underlined in Fig. 7A). Hydro- 
pathic plots revealed 6 stretches of hydrophobic 
amino acids which could correspond to mem- 
brane-spanning domains (Fig. 8A). 

The protein encoded by cDNA 15a had 41% 
Overall residue identity when compared to 
cysteine proteinase 1 from Dictyostelium 
(Fig. 7B). Amino acids 143 to 190 showed 76% 
sequence identity to the cysteine protease. The 



protein encoded by cDNA 15a also showed 
regions of 50 to 80% sequence identity with other 
thiol proteases (cathepsin H, papain, and 
aleurain) and contained conserved active site 
residues (Cys 154; His 300) [60, 66, 68]. Based 
on this analysis we tentatively identify the protein 
encoded by cDNA 15a as a thiol protease. A 
hydropathic plot of this protein revealed 2 long 
stretches of hydrophobic amino acids (Fig. 8B). 
The first is a 16 amino acid region immediately 
adjacent to a small basic N-terminal domain. 
Signal sequences in secreted proteins often show 
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AAriXICGCTAATCACAACACTTCGATMTCTCCCATAAAAGCCTCTCAAT^ 72 

CAGTTTCCAGAGAAATGGGTTCTGATAGCAACMTTTGGGATTCTT^ 144 
HGSDSNNLG FLKEIGLGAT N 

ACATCGGTTCTTTCATCMTCGCCAATGGAAAGCC^ 21 6 

IGSFINGQWKANGPTVHSVNPSTN 

ATCAGGTTATTGCTTCCGTGACTGAAGCAACTTTGGATGATTAT^ 288 
Q VIASVTEAT LDDYE EGLRASSEA 

CAGCAAAGACATGGAGGACTGTTCCGGCACCGAAAAGAGGTGAGATTG^ 360 
AKTWRTVPAPKRG £ IVRQIGDALR 

GGGCTMGTTGGATCCACTTGGTAGGTTGGTGGCTCrTGAGATGG 432 
AKLDPLGRLVALEMGKI LAEGIGE 

AGGTTCAGGAAATTATTGATATCTCTGATTATTCTGTTG 504 
VQEI IDMCDYSVGLSRQLNGSIIP 

CATCAGAACGTCCAGAGCATATGATGTTTGAGGTATGGAACCCATTAGGAATCGTO 576 
S ERPEHMMFEVW NP LGIV GVITAF 

TCAACTTTCCATGTGCTGTACTAGGATGGAATGCTTGCATTGCACTA^ 64 8 

NFPCAVLGWNACI ALVG G N T V V W K 

AGGGCGCGCCAACTACTCCATTGATAACTGTTGCTGT 720 
G A P T T P L I TVAVTKLIAEVFERNN 

ATTTACCCGGAGCAATATTTACrGCTCTCTGTGGGGGTGCTGATATTGGGCACG 792 
LPGAIFTALCGGAD IGHAI AKDTR 

GC ATTCC TTTGG TTTCATTTAC TGGMGCTCAAAGGTGGGCGCGCTGG TCCAGCAAAC AGTGAGTCAAAGAT 864 
I P LVSF TGSSKVGALVQQTVSQRF 

TTGGCAAMCCTTGCTTGAGTTGAGTGGTMCAATGC 936 
GKTLLELSGNNAI IVMDDADITLA 
-< 

C TG TGCGC TC TA TTTTCTTTGC AGC TG TCX^TACTGC TGGTCAGCGTTG TACAACC TGTC GTAGAC TGTATC 1003 
VRS I FFAAVGTAGQRCTT CRRLY L 

TGCATGAGAGTGTTTATGCAMCGTGCTTCAACAACTTACCGCA 1080 
HESVYANVLEQLTALYKQVKIGNP 

CTCTGGAGGAAGGGACACTAGTTGGGCCTCTGCATACTCGTTCTGCGGTGGAAAA 11 52 

LEEGTLVGPLHTRSAVENFKNGIS 

CTGCAATTAAATCTCAGGGAGGGAAGATTGTAACAGGGGGGTCTGTATTAGAGTC 1224 
AIKSQGGKIVTGGSVLESEGNFVV 

TGCCAACGATTGTTGAGATTTCTGCAGATCCTGCTGTAGTTAAAGAAG^ 12 96 

PTIVEI SADAAVVKEELFAPVLYV 

TTATGAAATTTMGGATCTTGAAGAAGCAATTGCCTTGAACAATTCCGTACCTCAA 1368 
M KFKDLEEAI ALNNSVPQGLSSSI 

TCTTCACACAAAAGCCTTCTACTATATTCAAATGGATAGGGCCAAGTGGA^ 1440 
FTQKPST IFKWIGPSGSDCGIVNV 

TCAACATACCAACAMTGGAGCTGAGATTGGAGGTGCCTTTGGTGGAG 1512 
NIPTNGAEIGGA FGGEKATGGGRE 

AAGCTGGAAGTGACTCGTGGAAGCMTACATGCGCCGTTCTACATC 1584 
A G S D SWKQYMRRSTCTINYGSELP 

CATTAGCTCAGGGMTTMCTTTGGCTAATTTTGAAGTTTGGTAG^ 1656 
LAQGINF G . 

TTTTCAAATGGTTGTTTCAACCAGATATGTTGTGAAAGTGGC^ 1728 

TGCCCAATTAATTGAGTGTTTGG 1751 

Fig. 7C 



a similar arrangement [29]. It has been reported 
thai aleurain, a thiol protease in barley, is secreted 
into vacuoles [58]. 

The sequence of cDNA 26g revealed an open 
reading frame which encoded a 508 amino acid 



protein (Fig. 7C). The deduced amino acid 
sequence had 30% residue identity with an alde- 
hyde dehydrogenase (aWDH) from Aspergillus 
nidulans [51]. Regions of higher sequence identity 
were scattered throughout the open reading frame 
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Fig 8 Hydropathy plots of the putative proteins encoded by 
cDNA 7a (A), cDNA 15a (B) and cDNA 26g (C). Positive 
values indicate hydrophobic regions. Plots were generated by 
the method of Kyte and Doolittle [35] using 6 amino acid 
intervals for comparison. 

(i.e., a.a. 46-55, 7/9; a.a. 183-196, 9/13; a.a, 
406-415, 7/9; a.a. 465-480, 10/15 identical 
amino acids) (regions underlined in Fig. 7C). 
Aldehyde dehydrogenases have a highly con- 
served sequence (VTLELGGKS) followed by a 
cysteine 28 amino acids downstream of this 
sequence [53]. The protein encoded by cDNA 
26g only had poor conservation of this sequence 
(TLLELSGNN) (underlined with an arrow in 
Fig. 7C) but did retain a cysteine (Cys 301) 29 



amino acids from this amino acid sequence. 
These data suggest that the deduced 508 amino 
acid protein is related to aldehyde dehydro- 
genases but has diverged in significant ways from 
this group of enzymes. Hydropathic plots of this 
protein revealed a hydrophobic domain of 21 
amino acids near the protein's N-terminus and a 
36 amino acid hydrophobic domain located 
between amino acid 152 and 188 (Fig. 8C). 



Discussion 

Exposure of plants to water deficit often results in 
inhibition of shoot growth, decreased photo- 
synthesis, osmotic adjustment and altered plant 
growth regulator levels [rev. 33, 67]. Some of 
these responses may be the direct consequence 
of reduced plant cell turgor. However, other 
responses involve changes in gene expression. 
Several authors have reported that water deficit 
causes a decrease in polysome content in growing 
tissues [1, 10, 32, 41, 42, 45, 63]. Decreases in 
polysome content and protein synthesis activity 
probably are global responses to reduced growth 
rates. In addition, specific changes in poly(A) 
RNA populations and protein synthesis are 
observed in plants exposed to low water 
potentials [2, 13, 14, 15, 24, 42, 45, 50]. Water 
deficit-induced changes in the abundance of 
selected transcripts and proteins suggest that 
specific biochemical responses of plants to 
dehydration involve changes in gene expression. 
For example, there is a correlation between seed 
desiccation, ABA accumulation and increased 
expression of specific genes [rev. 16]. The pro- 
teins encoded by this set of genes are hydrophihc, 
have repeating domains [8, rev. 16] and can be 
glycine-rich [21]. RNA hybridizing to these genes 
is not abundant in non-stressed plants but can be 
induced by dehydration [i.e., 8, 47]. The proteins 
encoded by these ABA-responsive genes have 
been proposed to protect cell structures from 
dehydration-induced damage [rev. 16]. In addi- 
tion to the ABA-responsive genes discussed 
above, other genes are induced by water deficit 
which are ABA-insensitive. For example, in- 
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creases in ABA which occur in wilted plants are 
dependent on nuclear gene transcription [ 17, 41] 
and several changes in poly(A) RNA population 
were observed prior to accumulation of ABA in 
wilted pea leaves [24]. Likewise, soybean 
seedlings exposed to mild water deficit 
( - 0.3 MPa vermiculite at 100% relative humid- 
ity) exhibited changes in poly(A) RNA popu- 
lations which could not be induced by exogenous 
ABA [10]. 

In this paper, the expression of 3 genes which 
show increased transcription and RNA levels in 
wilted pea shoots were characterized. ABA 
caused a small increase in RNA hybridizing to 
cDNA 7a but not to cDNAs 15a or 26g. These 
latter genes might respond to higher levels of ABA 
or to extended ABA treatment. However, the 
lack of ABA-responsiveness of these genes is not 
surprising because in a previous study, most of the 
poly(A) RNAs induced rapidly in response to 
shoot dehydration were not induced by ABA 
[24]. Furthermore, transcription of the genes cor- 
responding to cDNAs 7a, 15a and 26g was 
induced within 30 min following shoot dehydra- 
tion. However, increases in ABA in dehydrated 
pea shoots could only be detected after 30 min 
dehydration to zero turgor [24]. Finally, cyclo- 
heximide inhibits induction of ABA in wilted pea 
shoots [23] but this treatment did not prevent the 
increases in RNA hybridizing to cDNAs 7a, 15a 
and 26g. It was possible that the genes induced by 
plant dehydration would also be responsive to 
heat shock. In fact, one of these genes (corre- 
sponding to cDNA 7a) was induced when plants 
were exposed to 38 °C temperatures. Genes cor- 
responding to the other cDNAs (15a, 26g) were 
not induced by this treatment. This is consistent 
with previous studies showing that most of the 
changes in poly(A) RNA populations induced by 
dehydration of plants to zero turgor were not 
induced by high temperature [24]. More severe 
plant dehydration, however, may induce some 
heat shock genes [28]. 

Transcription assays using isolated nuclei 
showed that transcription of the genes corre- 
sponding to cDNA clones 7a, 15a and 26g is 
increased within 30 min of plant dehydration. 



Transcription of RNA hybridizing to 7a and 15a 
declined between 30 min and 4h after plant 
dehydration. However, the corresponding in vivo 
RNA levels of these genes did not decline suggest- 
ing that the RNAs are relatively stable. RNAs 
hybridizing to clones 7a and 15a also remain high 
when dehydrated seedlings are rehydrated for 
2 h. In contrast, RNA hybridizing to clone 26g 
declined when pea shoots were rehydrated. The 
increase in RNA hybridizing to clones 7a, 15a and 
26g was not prevented by pretreatment of pea 
shoots with cycloheximide. This indicates that the 
increase in RNA abundance is not mediated by 
proteins synthesized in response to dehydration. 

Sequence analsyis of the 3 cDNAs revealed 
that cDNA 7a encodes a protein with some 
sequence similarity to soybean nodulin-26, cDNA 
15a encodes a putative thiol protease, and cDNA 
26g a protein with some homology to aldehyde 
dehydrogenases. RNA encoding a thiol protease 
was reported to increase tomato in response to 
low temperature [60]. Shaffer and Fisher [60] 
noted that proteases could alter metabolism by 
increasing protein turnover rates or proteolyti- 
cally activating specific proteins. Another 
function which might be common to low tempera- 
ture and dehydration involves degradation of 
polypeptides denatured due to the stress. In addi- 
tion, the thiol protease could be involved in 
degrading vegetative storage proteins located in 
vacuoles. The mobilized amino acids would then 
be available for the synthesis of new proteins in 
response to stress or for osmotic adjustment. 
The homology between the deduced amino acid 
sequence of cDNA 26g and aldehyde dehydro- 
genases was interesting because betaine adehyde 
dehydrogenase and ABA aldehyde dehydro- 
genase are involved in responses of some plants 
to water deficit. However, pea plants do not 
accumulate significant levels of betaine [70] and 
a highly conserved amino acid sequence found in 
aldehyde dehydrogenases [53] was not highly 
conserved in the deduced amino acid sequence of 
cDNA 26g. Finally, the amino acid sequence of 
cDNA 7a showed some similarity to soybean 
nodulin-26 which is a peribacterioid membrane 
protein [17]. The protein encoded by cDNA 7a 
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had six potential membrane-spanning domains. 
This number of membrane-spanning domains is 
common to proteins which form K + , Na + or 
Ca 2+ channels [38]. The induction of ion chan- 
nels in response to reduced cell turgor would not 
be unexpected [30]. The high expression of this 
gene in roots vs shoots of control plants may also 
provide some clue to this gene function in well 
watered seedlings. 
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Abstract 

The identification of a cDNA (MR19) corresponding to a maize a-tubulin and homologous genomic 
clones (MG19/6 and MG19/14) is described. The cDNA has been isolated by differential screening of 
a cDNA maize root library. We have found two a-tubulin genes in a tandem arrangement in the genomic 
clones, separated by approximately 1.5 kbp. One of the genes (gene I) contains an identical nucleotide 
sequence which corresponds to the cDNA clone. The two deduced proteins from DNA sequences are 
very similar (only two conservative replacements in 451 amino acids) and they share a high homology 
as compared with the published a-tubulin sequences from other systems and in particular with the 
Arabidopsis thaliana and Chlamydomonas reinhardtii sequences reported. The structure of both genes is 
also very similar; it includes two introns, of 1.7 kbp and 0.8 kbp respectively, in each gene and only one 
intron placed at a homologous position in relation to Arabidopsis thaliana genes. By using specific 3' 
probes it appears that both genes are preferentially expressed in the radicular system of the plant. The 
a-tubulin gene family of Zea mays seems to be represented by at least 3 or 4 members. 



Introduction 

Microtubules are the basic components of the 
cytoskeleton. They play an essential role in 
eukaryotic cells in many processes such as cell 
division, internal transport, motility and morpho- 
genesis. As compared with animal systems plant 
cells present particular features that both restrict 
and give additional functions to microtubules. In 
fact, plant microtubules have specialized roles in 
mitosis controlling the plane of cell division and 
consequently the direction of cell elongation. In 
this sense microtubules are involved in determin- 



ing plant cell morphogenesis [ 16]. A particular set 
of plant microtubules, the cortical array, has been 
proposed to take part in the orientation of cellu- 
lose fibrils in the wall [27]. 

The major component of microtubules is tubu- 
lin, a heterodimer protein formed by two different 
subunits, alpha and beta, of approximately 
50kDa each. Both subunits share 40% of 
homology. In addition, a- and ^-tubulins have 
been extremely conserved through evolution. The 
majority of published sequences can be placed 
below 15% of divergence [26]. Tubulins (a and ft) 
are coded in vertebrates by complex multigene 



The nucleotide sequence data reported will appear in the EMBL/GenBank/DDBJ Nucleotide Sequence Databases under the 
accession number X 15 704. 



families. In these systems, members of the family 
are organized in a dispersed way. In man more 
than 20 a-tubulin genes have been described and 
approximately 15 genes have been characterized 
in mouse or rat [7]. Most of these are supposed 
to be pseudogenes [25]. 

In addition, several a-tubulin sequences have 
been published in lower eukaryotic systems. More 
than 15 a-tubulin genes have been described in 
Trypanosoma forming clusters, organized into 
tandem repeats of alternating a and jSgene units 
3.7 kbp long [24]. 

a-tubulin genes have also been characterized 
from various photosynthetic organisms such as 
Chlamydomonas reinhardtii [41] and Volvox car- 
terii [30]. These two species present two unlinked 
genes in each case. However, in comparison with 
the abundant information available for genes cod- 
ing for a-tubulin in animal systems, very few data 
exist from the plant kingdom [42]. Tubulins have 
been purified from plant cells by using their prop- 
erty of in vitro assembly into microtubules [32]. 
Protein data have indicated the existence of mul- 
tiple isotypes in tissues of the higher plant 
Phaseolus vulgaris [21 ], and monocotyledon endo- 
sperm cells [35]. Only the a-tubulin gene family 
from Arabidopsis thaliana has been extensively 
characterized so far in higher plants [28]. In this 
species four genes coding for this protein have 
been detected organized in a dispersed form in the 
genome. A possible cluster of alternating a//? 
tubulin genes has been proposed from mung bean 
[38], but no sequence data from cloning studies 
are yet available. - 

The existence of different tubulin genes in a 
given organism may correspond to the need of a 
differential expression in defined tissues or devel- 
opmental stages. Data from the literature indicate 
that in some cases different genes are expressed 
in distinct tissues or developmental stages. In rat 
the presence of a specific cerebellum isoform of 
a-tubulin has been described [ 14] while in mouse 
two specific genes are expressed in testis [49] and 
an unusually highly divergent testicular isoform of 
a-tubulin, not found in brain, has been described 
[20]. Constitutive and tissue-specific isoforms of 
a-tubulin have been observed in Drosophila 



melanogaster [48]. In contrast with the hypothesis 
of different a-tubulin genes present in functionally 
different microtubules, data from two yeast 
species studied, Saccharomyces cerevisiae and 
Schizosaccharomyces pombe, show only two genes 
for a-tubulin in their genome, only one of which 
seems to be essentially required whereas the other 
one is dispensable [43, 1]. 

As has been shown in animal organisms, plant 
tubulin genes may be interesting markers to study 
developmental processes occurring in plants. 
More complete information available for a plant 
system comes from Arabidopsis thaliana, where 
one of the genes seems to be specifically expressed 
in flowers as compared with leaves and roots 
[29]. Recent reports based on protein data indi- 
cate the existence of specific patterns of tubulin 
polypeptides in different organs of carrot [22]. A 
developmental control of the expression of tubulin 
proteins during somatic embryogenesis in cul- 
tured carrot cells has been described [9]. 

In the present paper we show that a-tubulin 
genes are cloned in Zea mays by a family of genes 
that includes a subgroup of two genes forming a 
tandem array. Both genes have a preferential 
expression in the radicular system. The structure 
of the proteins is compared with other known 
systems showing a high level of homology. The 
novel organization of a-tubulin genes in this plant 
species is discussed, as well as in different 
varieties of the maize group. 



Materials and methods 

Plant material 

All our studies were done with Zea maysL. 
(inbred line W64A) grown under greenhouse con- 
ditions. Plantlets were obtained by germinating 
dry seeds through imbibition in water at 25 °C in 
the dark for the indicated period of time. 

General methods 

DNA ligation, bacterial transformation (Escheri- 
chia coli DH5aF strain) and cloning procedures 



followed published methods [ 17]. Harvesting and 
preparation of lambda-phage DNA were carried 
out according to described protocols [ 10] as well 
as preparation of plasmid DNA, restriction 
enzyme analysis, agarose gel electrophoresis of 
nucleic acids and blotting [31]. Vectors used in 
subcloning and sequencing procedures were 
pUC18 and M13mpl8/19 [50]. 

Enzymes were purchased from Boehringer 
(Mannheim) unless stated otherwise, labelled 
nucleotides were from Amersham. 

RNA isolation and poly(A) + RNA selection 

20-40 g of plant material were frozen in liquid 
nitrogen and ground to a fine powder in a mortar. 
For the construction of cDNA libraries the 
guanidinium isothiocyanate method was used 
[31]. For preparing single-strand cDNA probes 
and Northern blot analysis the RNA was ex- 
tracted according to described methods [11, 45]. 
Poly(A) + RNA was selected on oligo(dT)-cellu- 
lose chromatography. 

cDNA library 

A AgtlO cDNA library was prepared from 
poly(A) + RNA of Zea mays roots two months 
old, essentially according to described protocols 
[15], with modifications. 

1) For first- strand cDNA synthesis 6 /xg of 
poly(A) + RNA was denatured by incubation at 
70 °C for 5 min and the reaction carried out in a 
volume of 55 /xL containing 50 mM Tris-HCl 
pH 8.3 (at 42 °C), 10 mM MgCl 2 , 10 mM DTT, 
1 mM dATP, 1 mM dCTP, 1 mM dGTP, 1 mM 
dTTP, 120 mM KC1, 25 u of RNasin (Stehelin), 
5.5 /xg of oligo-d(T) 12 " 18 and 20 u//xg RNA of 
Super Reverse Transcriptase (Stehelin). The mix 
was incubated at 42 °C for 1 hour. 

2) For second-strand cDNA synthesis up to 
1 /xg of single-stranded cDNA {i.e. 2 fig of hybrid) 
was processed in 200 /xl of 20 mM Tris-HCl 
pH7.5, 5mM MgCl 2 ; 10 mM (NH 4 ) 2 S0 4) 
100 mM KC1 150 /xM NAD, 200 /xM dATP, 
200 /xM dCTP, 200 /xM dGTP, 200 /xM dTTP, 
50/xg/ml BSA, 2.7 u RNase-H (Stehelin), 50 u 



DNA Polymerase I (Stehelin), and 5 u E. coli 
DNA ligase (Biolabs). Incubations were sequen- 
tially 2 h at 15 °C and 1 h at 22 °C. . 

3) The cDNA was methylated with Eco RI 
methylase (Biolabs), the ends repaired with 
T4-DNA-Polymerase I (Klenow, Biolabs), and 
the cDNA ligated to synthetic Eco RI linkers 
(10-mer, Biolabs). The cDNA attached to the 
linkers was cleaved with Eco RI and fractionated 
by agarose gel electrophoresis according to 
length. cDNAs between 500 bp and 3.5 kbp were 
inserted into the unique Eco RI site in the cl gene 
of the AgtlO [23]. The DNA was packeged in vitro 
[31] and plated onto C6Q0 Hfl E. coli K 12 cells. 
2/xg of poly(A) + RNA yielded 1.2 x 10 6 inde- 
pendent clones of Acl recombinants. 



Differential screening of the cDNA library 

Single-strand cDNA probes were obtained with 
poly(A)* RNA from 2-month-old roots and 
leaves of maize by a modification of the method 
described previously [46]. The reactions were 
carried out in a volume of 25 /xl containing 1 /xg of 
poly(A) + RNA of each organ separately; 1.4 /xg 
of oligo d(T) 12-18 (both prewarmed 3 min at 
65 °C and cooled on ice); 50 mM Tris-HCl 
pH8.3 (42 °C), 10 mM MgCl 2 , 10 mM DTT, 
0.5 mM dATP, 0.5 mM dCTP, 0.5 mM dTTP, 
0.03 mM dGTP, 50 /xCi 32 P-dGTP 
(3000 Ci/mmol, Amersham), 30 u RNasin 
(Genofit) and 10 u of Reverse Transcriptase 
(Genofit). The mixes were incubated at 42 0 C for 
45 min and unincorporated 32 P-dGTP separated 
bychromatographyona0.7cm x lOcmSephadex 
G-50 column. Three plaque filter replicas of 5 
plates of the cDNA library (3000 pfu/plate) were 
prepared. Replicas were obtained by transferring 
each.one during 2, 4 and 8 min onto nitrocellulose 
paper. The first and third filters were hybridized 
with single- strand cDNA probes from roots, and 
the second one with corresponding probes from 
leaves at a specific activity of ca. 1.5 x 10 7 cpm//xg. 
Hybridization was performed at Tm-25 with 
0.6 - 1 x 10 6 cpm/ml solution for 24-36 hours, in 
the presence of 50% formamide, and final washes 



(30 min each) were done in 0.2 x SSC, 0.1% 
SDS at 65 °C. 



Northern blot analysis 

10 /zg of total RNA and 0.5 /ig of poly(A) + RNA 
were fractioned in 1.5% agarose-formaldehyde 
gels [3], transferred to either nitrocellulose 
(BA85, Schleicher & Schuell, 0.45 /im) or nylon 
membranes (Hybond-N, Amersham) and hybrid- 
ized according to protocols suggested by the 
suppliers. Probes were labelled by nick-trans- 
lation [39] or random-primed (Boehringer- 
Mannheim kit) reactions [ 12] to a specific activity 
up to 10 9 cpm/^g. 



Southern blot analysis 

Isolation of DNA was performed following proce- 
dures described previously [4] with variations. 
20 g of plant material were frozen in liquid nitro- 
gen and reduced to powder in a mortar. 4 ml/g of 
extraction buffer (0.35 M NaCl, 10 mM Tris-HCl 
pH 8, 0.1 M EDTA, 1% Sarcosyl, 7 M urea and 
5% phenol) was added. The mixture was incu- 
bated for 2 h at 37 °C under gentle shaking. After 
phenol/chloroform extraction, DNA was ethanol- 
precipitated and purified through a CsCl gradient. 

10 ixg of genomic DNA digested with restric- 
tion enzymes were fractioned in 0.8% agarose 
gels and blotted onto nylon membranes 
(ZetaProbe, BioRad). Hybridizations were done 
in: 1.5 x SSPE (1 x SSPE is 180 mM NaCl, 
10 mM NaH 2 P0 4 pH 7.7, 1 mM EDTA), 1% 
SDS, 0.5% powered skimmed milk (Molico, 
Nestle) and 0.5 mg/ml of denatured salmon 
sperm DNA at 65 °C for 16-20 h. Washes were 
performed also at 65 °C up to 0.1 x SSC, 1% 
SDS. 



DNA sequence analysis 

Reactions were carried out using the M13- 
dideoxy nucleotide method according to de- 



scribed protocols [40] with modifications in order 
to use 35 S-labeled nucleotides [2]. Klenow DNA 
polymerase (New England Biolabs) and the 
T7-Sequencing Kit (Pharmacia) were used. The 
products obtained were separated in 6% acryl- 
amide 8 M urea 02-0.45 mm wedge gels. Se- 
quence alignment and analysis was done by using 
software from the CITI2 database (Paris) and 
Micro-Genie-(Beckman) [37]. 



Screening of the genomic library 

The screening of a ACh35 genomic library of 
maize [13] was carried out with cDNA clones 
labelled by random priming (Boehringer- 
Mannheim kit). Hybridization conditions of 
nitrocellulose replicas were: 10 6 cpm/ml, 5x SSC, 
SxDenhardt's solution, 50% formamide, 5 mM 
EDTA, 25 mM sodium phosphate, 0.1% SDS 
and 250 /xg/ml of denatured salmon sperm DNA, 
at 42 ° C for 24 h. The final two washes were done 
in 0.2 x SSC, 0.1% SDS at 65 °C for 15 min 
each. 

Results 

cDNA cloning and sequencing 

A group of cDNA clones corresponding to maize 
a-tubulin was found in the course of a differential 
screening of a maize root cDNA library using 
labelled single-stranded cDNA probes synthe- 
sized from root and leaf poly(A) + RNA. The 
library was contructed in Agt 10 and 15 000 recom- 
binant clones were screened. From those giving a 
preferential signal with root RNA, a number were 
selected to confirm their predominant expression 
in the radicular tissues by means of a northern 
analysis. One of these clones (MR 19), giving a 
7-fold increase in root DNA as compared to leaf 
(see Fig. 1 A), was one of those chosen for further 
analysis. The estimated size of the mRNA was 
approximately 1750 nucleotides. 

When northern analysis was carried out with 
RNA extracted from different parts of the maize 
plant the preferential expression of MR 19 in the 
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Fig. 1. A. Northern blot analysis of maize adult root and leaf 
RNA. Total RNAs (10 /ig) isolated from adult (60 days after 
germination) roots (1) and leaves (2) and corresponding poly 
(A) + RNAs (0.5 /ig) fraction from roots (3) and leaves (4) 
were probed with the insert of MR 19 clone (cDNA). The 
mobilities of 25 S, 18S and 16S ribosomal RNAs are indi- 
cated. The estimated length of the transcript (in kilonude- 
otides)is shown. B. Northern blot analysis of MR 19 a-tubulin 
in maize in different organs and several stages of develop- 
ment. The following RNAs were probed with the insert of 
MR 19 (cDNA) clone. Polysomal RNAs, membrane-bound 
(2 fig) purified from endosperm 20 days after pollination (1); 
total RNAs (10 fig) extracted from embryo 20 days after 
pollination (2); young roots two (3), four (4) and seven (5) 
days old; root tip from seven-day-old young roots (6); adult 
root (7); coleoptiles two (8), four (9) and seven (10) days old; 
upper part of coleoptile (including node) (11) and seven-day- 
old shoots (12) and adult leaves (13). 

radicular tissues was confirmed. The northern 
analysis of RNA extracted from endosperm and 
embryo 20 days after pollination, adult (two 
months old) roots and leaves, and different parts 
of different ages of the young roots and coleoptiles 
is presented in Fig. IB. It appears that, while 
RNA hybridizing with this cDNA is poorly repre- 
sented in leaves and shoots, the highest amount of 
mRNA is found in the meristematic part of the 
young roots, but it is also found in adult roots. In 
general, it appears that the mRNA is especially 
abundant in zones of the young plant rich in 
meristematic regions, such as young roots and 
coleoptiles, with a special increase in the radicular 



tissues. Although there is a good hybridization 
signal in coleoptiles, a clear decrease of hybridiza- 
tion in adult leaves is observed as compared with 
adult roots. Expression is also detected at a lower 
level in embryo and endosperm. 

Sequencing of the insert was carried out with 
the dideoxy method. The nucleotide sequence and 
the protein sequence deduced from it are shown 
in Fig. 2 (between asterisks at positions 6530 and 
8210, excluding introns), together with the 
MG19/6 and MG19/14 genomic clones (de- 
scribed below) in an overlapping form. Searching 
in the EMBL databank revealed a strong homol- 
ogy with Chlamydomonas renhardtii a-tubulin 
[41], and subsequently with all the other pub- 
lished sequences coding for a-tubulins. As could 
be expected from the size of the cDNA insert 
(1483 bp), as compared with the estimated 
mRNA length, the protein sequence deduced 
lacks the initial part of the N-terminus of maize 
a-tubulin. The non-translated 3' end of the 
mRNA is 280 nucleotides long and has two possi- 
ble polyadenylation signals at 27 and 104 nucle- 
otides before the beginning of the poly(A) tail (at 
position 8210) as shown in Fig. 2. Both sequences 
(AATAAT and AGTAAA) depart from a single 
nucleotide in the consensus sequence 
(AATAAA). 

Genomic cloning and sequencing 

The cDNA insert of the MR19 clone was used to 
screen a genomic library contracted in ACh35 by 
partial digestion with Mbo I of endosperm DNA 
from the maize inbred line W64A [13]. Eleven 
clones were found positive, and two having a 
strong hybridization signal (MG19/6 and 
MG19/14) were used for further analysis. Clones 
MG19/6 and MG19/14 were found to overlap, 
MG19/14 being longer at its 3' end and therefore 
containing the whole cDNA sequence (see 
Fig. 3). The sequence of the overlapping frag- 
ments showed that they corresponded to identical 
genomic sequences. 

From the restriction analysis and partial se- 
quence (8.5 kbp) of the insert of the genomic 



1 gtcgacgccccggtacgaagtgggtg^ 
121 aaaatagttadgat^aaagagagaalaaaatt^ 
241 cacatsgwccgtgtatgcacaawW 
361 taggccgagtcctgtaattttgtgacatutttt^ 
tgaagcgaaaaatatcataaattcataacgaac^^ 
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3121 
166 

3241 
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3361 



3841 
376 

3961 

4061 
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iccacacttccttcgccggctcatggtg^ 

721 guttcgtactcgcctatctccgwct^^ 

841 OXGGTATCCAGGTCGGAAACXEGTGC 

f£ AO I QVGNACWELY.CLEHG I Q 

961 atgggacttccgtcgct^tgcgtgctgta^ 

1061 cgcgtttgtcaggtctgatccgtatatatggt^^ 

1201 tttccatgatccaatgaatttagtagttatgtat^ 

1321 aattttcataggccMgttcctattgcatgcctt^ 

1441 tccaacaaggagctctaaaatagagccctaaaagtt^^ 

1561 agactaagaggaaagsaggatagggctccaa^^^ 

1661 ataatcccacaccctatatttcaaaatacagcct^ 

1601 ttttgaaatagtggatgtgataggagatagaattg^ 

1921 ccttgacccaattgccttttgctgtttagatcW 

2041 ttcacatatccgaagtgutgcaatcatggatttaa^ 

2161 cgggctctaatcctattcautttgaattaaaattantaag^ 

2281 tccattcaggcatgtgttaccttaagattatgcgtc^ 

2401 ctcccagittttaagaaaclggttlatagaatta^ 

2521 tctctcctagctaaaagataaaaaccagtttcttaaaa^t^ 



A D G Q M 



2641 OCGGTGACAAGACCATTGGGGGTGGTGATGATCC^ 

J^PGDKTIGGGDOAFNTFFSETGAGKHVPRAVFVDLEPTV I D 

2761 AGGTGAGGWCTGGCAOCTAOCGXAGCTCTTOCACOCTG 

77EVRTGTYRQLFHPEQL I SGKEDAANNFARGHYT 

2881 tcagtttacaatgctcctggamttgctttgtaatgaacatct^^ 
110 ^ ^ ^ ^ ^ ^ ^1 GKEIVDLCLDRIRKL 

3001 aG*CAACTQC*CT3GTCT0^^ It* I lOOjUXiaiGGAOCGTCIGICIGl IWCT^COOCAAGAAGTOCA 



DNCTGLQGFLVFNAVGGGTGSCLGSLLLERLSVDYGKKS 
ACCTTGGTTTCACTGTGT^CCCATOXOOCM 

KLGFTVYPSPQVSTSVVEPYNSVLSTHSLLEHTDVAI LLO 
ACGAGGCCATCTATG^TCTGCCGGCGTTCCC 

NEAI YDICRRSLDI ERPTYTNLNRLVS Q 
ggtctgtmctgggcuuctctttaaccatcagttgtt^ 

V I SSLTASLRFDGALNVOVNEF 



3481 (XXGKpWTKGT^ 

256 QTNLVPYPRIHFHLSSYAPVI SAEKAYHEQLSVAE I TNSA 

3601 CTTCGAGC^TOTOCATGATGGCCM 
296 FEPSSMMAKCDPRHGKYMACCLMYRGDVVPKDVNAAVATl 

3TC1 CAAGA^OCGCA^]a^GTTTGT(Xy 
336 KTKRTIGFVDWCPTGFKCG I NYGPPSVVPGGDLAKVGRAV 

GT^raiCTOCA^TOCA^CTCTTGTG^ 
CMI SNSTSVVEVFSR I DHKFDLMYAKRAFVHWYVGEGMEE 

GGGCGAGTTqCTGAGGCO^^ 

GEFSEAREDLAALEKDYEEVGAEFDEGEEGDDGDE Y 
tgatgcggtcatx^tcaggcttgtgtgctgctcttgtccccgttgtggtttgcaacacct^ 
cgttatggatggttgtcta^ctacaUattgcttctcgamtggaaa^^ 
4321 9cctgtcaagcalcaggattgcgttatgagt 
4441 aitawmx^tcciaaj^wtuttatca 
4561 acaagtga^c^taaatacctaa^ 
4661 gc^taaaUccacaataattttttggtggcaccamttt^ 
4801 tcacgtmtaeaatttaaaactatcaactaaaattgtctaatggat^^ 
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4921 agaa^tactaaaggttadgaatttaggtacacttacgactag^ 

504 1 tcctcwttuttaactcccaaactatcccttatt^ 

5161 taatgccctgcaaggtttcgautccgtagc<^tara 

5261 aatctcaggggctcaaacgaaaaaacaccgccccctt^cctcgcttggctggcactccat^ 

5401 taaccctaattgagcatt^cagagataggcgta 

5521 cATGAGGGAGTOCATTTOGATOCV^ 

5M1 ttctccctctgttg^tagatctataatcggtatggOT^ 

5761 cggatctgsatgcatgtaatggtggctaccggatccgggtttgctaggtctgatccgg^ 

5681 gaatagggatcacccgaaacccac^tgcgtggtagatctgagtgcccagaatw 

eooi atgtgagccttgatgtacaaatacagactatagt^ 

6121 aattctgtaaaaaaaa<^tacacgt(actctta^ 

6241 ttattagctttatctaggactaaatgctgttcacatatttc^ 

6361 ggcctgtgttaccttaagatcatgcatctgtctggtctacta^ 
32 

6481 GTCACATGOOCGG7WCMCAOCATTXXX3GGA 

34 GQMPGOK T I GGGDOA FNTFFSE TGAGKHVPRTAVFVOLEPT 

6601 TOTOATGAGCTGAGA/C^^ 

74 VtOEVRTGTYRQLFHPEQL I SGKEDAANNF ARGHYT 

6721 atctgagcgtcagttttcactgctccttgatattw 

^ ^ I I V U L \* L U K I 

6™ ^^GCTTGG^ 
<=5? LLDNEA I YOICRRSLDI ERPTYTNLNRLVSQ 

7201 tttcgcattttgcctggtaaactgg^^ 

^ WW 

7681 TGG0CMGGTGCA(X^GTGCCGTCTGCATCA7C 
J60 LAKVQRAVCM I SNSTSVVEVFSRI DHK FDL MYAKRArVHW 

7801 ACGTCGGTCAGGGTATGGACXjMGGTGAGTTCTC 
406 YVGEGMEEGEFSEAREOLAALEKDYEEVGA EFOEGEDGyt 

7921 GTOCCACTACtagagaagtttgc^ 

446 G D E Y M)m 
8041 gtgtggtttacaacacctgatgtt gtaa^ 

ei 61 ccaccttgttcatcttttgtcatxa^ 

8281 tacgatc 

Fig. 2. Nucleotide sequence of the tandem a-tubulin genes in Zea mays. The sequence (8287 nt long) corresponds to MG19/6 
and MG 19/14 overlapped genomic clones as showed in Fig. 3. The sequence is numbered beginning from a Sail site marked 
with an asterisk in Fig. 3. Sequence of the MR 19 cDNA clone is indicated between asterisks above positions 6530 and 8210, 
excluding introns. The deduced coding sequences are presented in upper-case letters. The predicted amino acid sequence of 
a-tubulin I and II (single-letter code) is shown below the corresponding nucleotide triplets. The putative TATA boxes ( = ) and 
polyadenylation signals ( + ) are marked above the indicated nucleotides. The intergenic repetitive sequence shown in Fig. 4 lies 
between positions 4373 and 4693 denoted with points above nucleotides. 



clones it appeared that the genomic fragment 
contained two regions homologous to the MR 19 
cDNA probe. Sequence data indicated that two 
different units coding for a -tubulin, genes I and II, 
were present. The restriction map of the genomic 



insert and the location of the coding sequences 
and introns are shown in Fig. 3. The 3' region 
(gene I) contains a nucleotide sequence identical 
to the cDNA, except for the presence of three 
insertions flanked by the consensus sequences of 
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Fig. 3. Restriction maps of genomic clones MG19/6 and MG19/14 showing two a-tubulin units, genes I and II. In both genes 
white boxes represent introns and black ones the corresponding exon coding segments. The putative TATA boxes, ATG and 
stop signals are indicated. The map of the MR 19 cDNA clone is also shown. The dotted box located between genes represent 
a zone rich in repetitive sequences (see Fig. 4). The sites presented here are marked as follows: H (Hind III), E (Eco RI), S (Sac I), 
X (Xho I) and B (Bam HI). Arrowed lines marked with A, B and C show probes used in experiments described in Fig. 5 and 6. 
The nucleotide sequence presented in Fig. 2 lies between a Sal I site (marked with an asterisk) and the 3' end of the MG19/14 

genomic clone. 



introns. The central region of the genomic clone, 
hybridizing with the MR 19 probe, resulted in 
another gene (gene II) coding for an a-tubulin 
protein of an identical protein sequence as com- 
pared with the other one, except for two conserva- 
tive replacements (Glu/Asp, Asp/Glu) in the 
hypervariable acidic C-terminus of the protein 
(see Fig. 7). 

The nucleotide sequence of the two genes is 
very similar but clearly not identical. They show 
95% homology when only the translated se- 
quences are considered. Among 1353 translated 
nucleotides in both genes we have found 72 nucle- 
otide replacements while only 2 (as mentioned 
before) result in amino acid changes. 

The sequence of the unit of tandem genes, 
resulting from MG19/6, MG19/14 and MR19 
clones, is presented in Fig. 2. Genes I and II are 
interrupted by three introns. The first arid third 
introns are located between amino acid positions 
31/32 and 233/234, respectively, whereas the 
second one is found between the first and second 



base of triplet coding for amino acid 110 of the 
protein, the same position of the second intron of 
A. thaliana a-tubulin genes [42]. In particular, 
although their position is identical in the two 
genes, the length and the sequence of the introns 
are quite different. The first intron is 859 bp long 
in gene I and 1724 bp in gene II, the second one 
is 93 bp long in both cases, and the third one is 
105 bp long in gene I and 92 bp long in gene II. 
The six introns present consensus 5' (GT - ) and 
3 ' ( - AG) recognition signals as reported in the 
literature [18]. 

Both genes have consensus TATA boxes at 
approximately the same distance from the initial 
ATG, which is - 149 and - 152 for gene I and II 
respectively. The nucleotide positions surround- 
ing the putative TATA boxes have been con- 
served showing a high degree of homology. When 
the 5' end of the coding region is compared with 
the a3-tubulin A, Thaliana sequence [28], good 
homology is found between the transcriptional 
starting point for gene a3 of Arabidopsis and the 



ACGTTTAAACTTGTCTAAGGTGCCATCTAGATCATGAACTTGTCAAGG 
<fl ^^mamgnin fo <ft rm m 



GTTGCCACTTAGATCATGAACTTCGTAAATATGTTTTTGGATCCAAAA 

» < i > o 



TATGI IM1 IATCCTTAAGGGTGTGTTTGTGTGTTTGGTTGAATGTATA 



AGAAGGGATGAAAGAGGAATGTCATAATTTCTATAGTGTTTGGTTGAG 



AGACAAGTGAGGACGAGATAAATACCTAAGAAGGGATGAAAGAGGAA 

§><0=C> 



TGCCACAATTTCTATAGTGTTTGGTTCAGAGACAAGTGACAATTTCTA 
^TU.— , - , — J N^^rf* ! — . . — . ..^ 



TAGTGTTTGGTTGAGAGACAAGTGAGGGCGAGTAAATACCGCAATAAT 

5 > < > 



4. Repetitive sequences found in the intergenic region of 
the MG19/6 genomic clone. Equal patterns of duplication are 
shown with homologous filled arrows. The Bam HI site is 
presented to locate the region in the genomic map (see 
Fig. 3). 

maize genes. This point is situated at 34 bp 
(gene II), and 35 bp (gene I), downstream from 
the putative TATA boxes. Both genes have TAG 
as stop codon and present possible polyadenyla- 



tion signals (shown in Fig. 2) placed, as men- 
tioned before, in gene I and a single one at 120 
nucleotides from the stop codon in gene II. 

The intergenic region is 1452 bp long, from the 
stop codon of gene II to the ATG of gene I. In this 
zone homologies between the two genes are found 
in the transcribed 3' and 5' ends and within an 
approximately 200 bp region surrounding the 
TATA box. An especially interesting sequence is 
found in a zone (marked with a dotted box in 
Fig. 3 and denoted with dots in Fig. 2) that seems 
to separate the two genie regions. A number of 
repetitions not observed in other parts of the 
sequenced fragment is found. Repeats from 12 up 
to 60 bp long can be observed in a variable num- 
ber up to 4 times. This sequence is shown in Fig. 4 
with the indicated repetitions. 



Gene structure and differential expression 

The proteins encoded by gene I and II slightly 
differ in their C-terminus. the 3' untranslated 
region has also marked differences between the 
two genes. As has been done in other cases using 
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Fig. 5. Southern analysis of genomic a-tubulin sequences homologous to the MR 19 cDNA clone of Zea mays. Southern blots 
were prepared from Zea mays genomic DNA digested with Xho I (X), Sac I (S), Hind III (H), Eco RI (E) and Bam HI (B) and 
hybridized with A, B and C probes respectively (see Fig. 3). Probe A corresponds to the most conserved part of a-tubulin 
sequences. Probes B and C correspond to the hypervariable and specific carboxyl-terminus protein sequences and the 3' ends 
of genes I and II. The same filter was reused in order to allow a better comparison of the observed signals. D. Southern blot 
analysis of a-tubulin genes in several related cereals. DNAs were extracted from Sorghum bicolor(\\ Zea diploperennis (teosinte) 
(2), an Fl from Zea diploperennis x Zea mays (Palomero Toluqueno) (3), Zea mays inbred line A188 (4), Zea mays inbred line 
W64A (5) and Zea mays Black Mexican Sweet (6). All these DNAs were diggested with Eco RI and hybridized with probe A 

(see Fig. 3). 
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3 > specific probes, it could be possible to study 
both the genomic structure as seen in Southern 
analysis and the expression of the two genes by 
northern blots. The Southern blot using a probe 
that contains the nucleotide region coding for the 
most conserved domain of the protein (probe A in 
Fig. 3) gives the pattern shown in Fig. 5A. 

At this level of stringency two types of bands of 
different intensity may be observed (see, for 
instance, the Hind III lane). Using 3' specific 
probes (probes B and C in Fig. 3) a pattern giving 
rise to single Southern bands is obtained (Fig. 5B 
and C). By comparing the three blots it is possible 
to interpret the pattern of intense bands obtained 
with the probe of the invariable zone and, with the 
help of the two other probes, to attribute them to 
either gene I or gene II (see Fig. 3). 

The bands observed in the Southern blots using 
3' probes correspond exactly to the genomic 
restriction fragments of the cloned segments as 
presented in Fig. 3, except where sites outside the 
clones are involved. The Xho I lane shows three 
bands when probe A is used. The lowest and most 
intense one corresponds to both gene I and 
gene II (see restriction map in Fig. 3), giving a 
higher intensity of hybridization. The bands hav- 
ing a lower mobility and intensity are supposed to 
correspond to other members of the family of 
a-tubulin genes. Sac I lane gives only two bands 
using probe A. The larger one would match per- 
fectly to both corresponding genomic fragments 
of genes I and II, as can be seen when probes B 
and C are used, whereas the smaller one will be 
the sequence homologous to other gene(s). The 
Eco RI lane shows the simplest pattern since 
using probe A only two bands appear. Each one 
can be directly related to gene I or gene II when 
specific probes are used. The Hind HI lane gives 
two bands of higher intensity together with two or 
three smaller and fainter bands. The first and 
second bands account for the corresponding 
genes (I or II, observing the signal when specific 
probes are used). The rest of the bands corre- 
spond to the other less homologous members of 
the family. A similar behaviour can be observed 
in Bam HI lanes. 

The genomic structure of the corresponding 



genes of a-tubulin in different varieties of maize 
and related species has also been studied. In 
Fig. 5D the Southern blot of DNA from Sorghum 
bicolor, Zea diploperennis (teosinte), and Fl gen- 
eration resulting from Zea diploperennis x Zea 
mays (Palomero toluquefio) and three inbred lines 
of Zea mays (A188, W64A and Black Mexican 
Sweet) is shown. Although polymorphisms are 
observed at the genomic level, the number of 
genes homologous to those here studied appear to 
be similar in all these plants. 

The same probes used for the Southern analy- 
sis were used to measure the expression of the two 
genes in different parts of the plant. The pattern 
of distribution of the mRNA using the probe for 
the conserved region (Fig. 6A) or the 3 ' probe of 
gene I (Fig. 6B) is quite similar. When the 3' 
probe of gene II is used (Fig. 6C) some dif- 
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Fig. <5. Analysis of mRNA accumulation of a-tubulin genes I 
and II. Northern blots were prepared with total RNAs 
extracted from, two-day-old coleoptiles (1) and young roots 
(2); poly(A) + RNA (0.5 fig) from adult leaves (3) and roots 
(4); total RNAs (10 ^g) from adult leaves (5) and roots (6). 
The filters were hybridized with probes A, B and C, respec- 
tively (see Fig. 3) and with histone H3 maize probe (H). 
Filters were washed and reprobed for a better comparison of 
results. In each case the same time of exposition (7 days) was 
used. 
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ferences can be observed. In particular, the 
expression of this gene seems to be only confined 
to young plantlets with preferential expression in 
young roots. 

Parallel experiments were carried out with 
maize histone H3 [6] (shown in Fig. 6) and H4 
[34] (data not shown), as probes for detecting 
expression in organs rich in dividing cells. The 
results indicated that a very similar pattern of 
mRNA accumulation of these two histone genes 
is obtained in organs rich in meristematic regions, 
such as young roots and coleoptiles, while a dif- 
ferent pattern is observed when probe A, B or C 
are used. Autoradiographs of northern blots were 
scanned in a densitometer to obtain relative 
quantification of bands in the different lanes. 
When the signal of histone H3 is taken as a 
reference, the expression of gene I in radicular 
tissues is 9 times more intense than in the 
coleoptile, and there is a 6-fold increase in gene II 
indicating a preferential expression of gene I 
(and II) in the radicular system of Zea. mays. 

Discussion 

A tandem repeat of a-tubulin genes has been 
cloned in maize as a result of a differential 



screening of a root cDNA library. The two genes 
code for two almost identical proteins with only 
two conservative (Asp/Glu and Glu/Asp) replace- 
ments in the hypervariable C-terminus of the 
protein. The sequences are more variable at the 
nucleotide level but keep 95 % homology between 
the two genes. 

When the protein sequence of maize a-tubulin 
is compared with that of other known a-tubulin 
sequences, a high degree of similarity is found. In 
Fig. 7 the two maize sequences are compared 
with the two a-tubulin sequences known from 
A. thaliana, al [29] and a3 [28], with the uni- 
cellular algal sequence a 1-tiibulin form Chlamydo- 
monas reinhardtii [41], and with the kal-tubulin 
from man [8]. The homology between two organ- 
isms as diverged as Homo sapiens and Zea mays 
yields a remarkable level of homology (83.4%). 

Comparison of the protein sequences demon- 
strate that the maize sequences a 1- and a2-tubulin 
corresponding to genes I and II, respectively, 
share an 99.6% homology. In addition, the 
highest similarity is found between the Chlamydo- 
monas protein and a 1 -maize (91.8%) or a2-maize 
(91.6%), a degree of homology exceeding that 
found with a3 protein from Arabidopsis (91.2% 
for al-maize and 90.7% for a2-maize). Homology 
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Fig. 7. Comparison of a-tubulin protein sequences between different organisms. The predicted amino acid sequence of the 
a 1 -tubulin of Zea mays (coded by gene I) is presented at the top line of single-letter code. The predicted amino acid sequences 
of the a2-tubulin of Zea mays (coded by gene II) and other a-tubulin are shown below respecting an optimal alignment. Only 
those amino acids that change from a 1 -tubulin of Zea mays are indicated. A dash in the sequence means a gap in the amino 
acid sequence introduced in order to maintain a good alignment. Sequences from Chlamydomonas reinhardtii a 1 -tubulin, 
Arabidopsis thaliana al and a3 tubulin, and Homo sapiens Kal-tubulin are those published in refs. [41], [29], [28] and [8], 

respectively. 



between ocl and a3 proteins of Arabidopsis is lower 
(91.2%) than between the two maize tubulins. 
These °data would suggest that the duplication 
observed in maize is relatively recent and, in any 
case, more recent than the duplication observed in 
Arabidopsis. On the other hand, we observe by 
Southern analysis a similar pattern of genomic 
structure in different varieties of Zea mays and 
related species (Fig. 5D), indicating that the 
existence of these two related genes may be a 
general characteristic of the group. 

Both deduced proteins al and a2 from maize 
present a Tyr as a C-terminal end. The same has 
been reported in all other plant or algal a-tubuhn 
sequences published showing that the phenome- 
non of absence of this ending amino acid of the 
variable 3' terminus may be restricted to the ani- 
mal kingdom [36]. There is also a high degree of 
conservation in the sequence postulated to be 
involved in GTP binding in the a-tubulin se- 
quences (amino acids 143-149) [44]. 

The introns and the flanking regions show a 
much larger divergence between the two genes 
than the coding sequence. The similarity between 
homologous introns of both gene I and II are 
88.2% in the second intron and 61.9% in the third 
one. A much lower similarity in the first intron is 
observed. It presents homology only at defined 
stretches. This intron seems to have evolved 
incorporating different sequences in gene II which 
are not present at all in the corresponding region 
of gene I. This first intron is unusually long 
(859 bp for gene I, 1724 bp for gene II) for plant 
introns reported until now [19]. Likewise, the 
location of introns differs from that observed in 
other species. Only the second intron is placed 
at the same amino acid position (110) as those 
observed in the two a-tubulin genes of Arabidopsis 
thaliana [28, 29], while the two other ones are dif- 
ferent in position and length. Although the posi- 
tion is the same in the second intron their lengths 
are quite different in both plants (512 bp for the 
al gene, and 76 bp for the a3 gene of A. thaliana; 
93 bp in both genes of Z. mays). Low homologies 
are observed when the second introns of both 
organisms are compared, but short segments (up 
to 30 nt) of the sequence appear to be extremely 



similar. The finding of the second intron con- 
served at the same place in both plants, A. thaliana 
and Z. mays, may be a result of the obvious 
evolutionary relationships between both organ- 
isms, while the homology observed when the cod- 
ing region is compared puts the described maize 
a-tubulin genes nearer to the corresponding ones 
from C. reinhardtii (91.7% on the average) than to 
the other ones from A. thaliana (88.2% on the 
average). These results may reflect the existence 
of two different rates of evolution, one related 
with" intron positions and the other one corre- 
sponding to the coding sequence itself. 

In the flanking regions homology is found 
between the two genes in the region surrounding 
the TATA boxes up to - 250 from the ATG. 
When more upstream regions are compared 
defined homologous segments can be observed in 
both genes separated by divergent sequences. An 
interesting sequence is found in the 5' end of the 
intergenic region. This is the only part of the 
8.5 kbp sequence where a high degree of repeti- 
tions is found. In fact, a complex structure of 
duplications may be observed at this point (see 
Fig. 4). The functional meaning of these dupli- 
cations either as a part of the promoter or as a 
hinge region in the gene duplication will be object 
of future experiments. 

The cDNA clone presented has been identified 
after a differential screening of a root cDNA 
library searching for clones corresponding to a 
mRNA preferentially expressed in the radicular 
tissues. The northern analysis presented indicate 
that the a-tubulin genes described here are ex- 
pressed in the meristematic parts of the plant, for 
example in the coleoptile or in the root tip, and 
therefore their expression may be correlated with 
cell division. In order to investigate this correla- 
tion we decided to use maize histone probes 
[6, 34] as a reference for division activity, since 
H3 and H4 genes are particularly expressed 
where DNA replication is present. In addition, it 
has been proposed that histone and tubulin genes 
have a cell-cycle regulation, giving the major 
peaks of accumulation simultaneously at the late 
G2 phase, as has been demonstrated in Physarum 
[5]. However, when the mRNA level of a-tubulin 
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^ con- is compared with that of histone genes a relatively would correspond to more than one gene it would 

aliana high expression is observed in the radicular imply an extremely high conservation of se- 

wious tissues, indicating that what is observed is not quences, even of those located downstream and 

>rgan- only the result of a concentration of dividing cells upstream of the coding regions of the a-tubulin 

e cod- (as could result from the differential screening genes. That would be unlikely in maize, specially 

maize approach) but also an organ-specific control when polymorphisms are present in most of the 

0ones superimposed on these one. The two genes show restriction sites used, located within the coding 

hanto similar patterns of expression with gene II mRNA region of these two highly conserved genes, 

in the accumulation being restricted to the meristematic Southern analysis also shows that other a-tubulin 

stence parts of the young plantlet. sequences, having a lower degree of homology, 

elated Tubulin genes expressed in defined tissues have exist in the genome of the plant. Our results and 

corre- been observed in animals and data are available analysis of different genomic clones obtained 

^ from human, mouse, rat, chicken and other verte- (data not shown) would indicate a small number 

found brate and invertebrate organisms [47]. In verte- of genes (probably three or four) coding for the 

inding brates, the variable acidic C-terminus amino acid whole a-tubulin family of Zea mays. 

ATG. positions of the protein would act as a specific The expression of both a-tubulin genes could 

lpared domain related to the particular expression of the be divided into two aspects which may interact at 

0sd in gene. No evidence in that direction has been different levels of regulation of gene expression. 

3s. An detected in plant tubulin sequences, probably due First, it is clear that a divisional control exists, 

of the to the lower number of sequences and situations since both histone (H3/H4) and a-tubulin genes 

of the reported. In plants little is known about this accumulate in the same meristematic organs such 

epeti- question. Most data come from A. thaliana where as root tips and coleoptiles. In addition, a second 

ire of a gene, the al-tubulin [29], is diferentially level of control appears indicating a preferential 

# (see expressed in flowers and through differents expression in radicular tissues when the signal of 

dupli- phases of flowering, whereas the a3-tubulin [28] histone H3 probe is taken as a reference, 

r as a is expressed contitutively in all organs examined. It would be very interesting to focus future 

object Also a ^-tubulin (the /?1 gene from A . thaliana) has experiments on analyzing the meaning of this kind 

been described presenting a preferential accumu- of organ-specific expression, in order to locate the 

«ified lation in the root system of the plant as we have al- and a2-tubulin proteins in particular sets of 

>NA shown for the al and a2 maize tubulin genes. In cells. In this way, genes I and II and their protein 

I to a addition, a possible organization in a tandem products may be useful markers of development 

licular array has been proposed for other /^-tubulin genes in the root system of maize, 

dicate of A. thaliana, but no sequence data are available 

re ex- up to now [33]. 

0, for In summary, a group of a-tubulin genes from Acknowledgements 
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Abscisic acid and water-stress induce the expression of 
a novel rice gene 



Claim 61 

ABA-inducible promoter 



John Mundy and Nam-Hai Chua 

Laboratory of Plant Molecular Biology. Rockefeller University, 1230 
York Avenue, New York, NY 10021, USA 

Communicated by D.von Wettstein 

We have identified a novel rice gene, called RAB 21, 
which is induced when plants are subject to water-stress. 
This gene encodes a basic, glycine-rich protein (mol. 
wt 16 529) which has a duplicated domain structure. 
Immunoblots probed with antibodies raised against 0- 
galactosidase/RAB 21 fusion protein detect RAB 21 pro- 
tein only in cytosolic cell fractions. RAB 21 mRNA and 
protein accumulate in rice embryos, leaves, roots and 
callus-derived suspension cells upon treatment with NaCl 
(200 mM) and/or the plant hormone abscisic arid (10 /iM 
ABA). The effects of NaCl and ABA are not cumulative, 
suggesting that these two inducers share a common 
response pathway. Induction of RAB 21 mRNA accumu- 
lation by ABA is rapid ( < 15 ruin in suspension cells) and 
does not require protein synthesis, indicating that pre- 
formed nuclear and/or cytosoBc factors mediate the re- 
sponse to this hormone. We have characterized the RAB 
21 gene by determining the complete nucleotide sequence 
of a nearly full-length cDNA and corresponding genomic 
copy, and by mapping the start site of its major tran- 
script. The proximal promoter region contains various 
GC-rich repeats. 

Key words: genomic sequence/cDNA sequence/cyclohex- 
imide/osmotic regulation/water deficit 



Introduction 

The hormone abscisic acid (ABA) mediates a number of 
important physiological processes in plants (King, 1976; 
Jones et al., 1987). Developmental studies have shown that 
ABA induces the accumulation of specific mRNAs and 
proteins late during embryogenesis in seeds of diverse species 
(Galau et al, 1986; Finklestein et al, 1985; Litts et al., 
1987). At this time, the level of endogenous ABA increases, 
the seeds desiccate and the embryos of some species become 
dormant (King, 1976; Suzuki etal., 1981). In cereal seeds, 
some of the Mate' developmental mRNAs are long-lived in 
mature, dry grains, but are rapidly degraded during seed 
germination. However, exogenousry applied ABA causes the 
precocious accumulation of these mRNAs in immature em- 
bryos and their reappearance in germinating seeds (Finkle- 
stein etal., 1985; Mundy etal, 1986). Little is known about 
the functions of these ABA-inducible proteins or about their 
intracellular localization. Some of them are storage poly- 
peptides (Finklestein et al. , 1985: Bray and Beachy, 1985), 



while others, such as lectins (Raikel and Wilkins, 1987) or 
an enzyme inhibitor (Mundy et al., 1986) may be involved 
in seed protection and/or the maintenance of dormancy. 

Physiological studies have shown that endogenous ABA 
levels increase in plant tissues subjected to water-stress by 
high osmoticum, NaCl, or drying (Henson, 1984; Jones et 
al., 1987). Under these conditions, specific mRNAs and 
proteins accumulate which could affect intra-cellular osmo- 
lality or have other protective functions (Finklestein and 
Crouch, 1986; Ramagopal, 1987). One such salt-inducible 
protein whose accumulation is increased by ABA has been 
isolated from tobacco and shown to be homologous to 
members of a group of proteinase inhibitors (Singh et aL, 
1987; Richardson et al., 1987). These results suggest that 
some of the ABA-inducible mRNAs and proteins which 
accumulate during seed desiccation are part of a general 
response by the plant to water deficit. If so, then part of 
this response involves the synthesis of enzyme inhibitors and 
lectins which may protect plant tissues from degradation by 
pathogens during periods of arrested growth and 
development. 

Both the developmental studies on seeds and the physio- 
logical studies on water-stress indicate that ABA controls 
the accumulation of specific mRNAs and proteins. However, 
it is unclear whether ABA acts at the transcriptional or post- 
transcriptional level, or both (Jacobsen and Beach, 1986; 
Mozer, 1980). The mode of action of the hormone via 
receptors and/or transducing pathways also remains obscure 
(Hornberg and Weiler, 1984) and it is not known whether 
transduction of the ABA response signals requires de novo 
protein synthesis. To date, no genomic sequences have been 
reported for plant genes strongly induced by ABA. Charac- 
terization of promoter sequences of such genes will provide 
a tool with which to dissect the mechanism by which ABA 
regulates specific gene expression. 

We are interested in determining how gene expression is 
regulated by plant hormones. We have chosen to study the 
effect of ABA on gene expression in rice because, as we 
demonstrate here, this hormone plays a central role in seed 
development and in the response of rice plants to water- 
stress, two important agronomic traits (Chang et al. , 1986; 
Seshu and Sorrells, 1986). Knowledge of the structure and 
function of ABA-responsive proteins will aid our under- 
standing of the physiology of seed maturation and of drought 
tolerance in cereals. As a first step, we have isolated several 
cDNA clones whose expression is induced by ABA and 
water-stress. One of these clones, called RAB 21 (for 
Responsive to ABA), was fully characterized. We present 
here the sequence of this novel rice gene, and a characteriz- 
ation of the RAB 21 protein product. We show that the 
induction of the RAB 21 by ABA and water-stress is rapid 
and independent of de novo protein synthesis. 
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Results 

Isolation of an ABAnnduoble cDNA encoding a 21-kd 
polypeptide 

As a first step toward isolating genes from rice whose 
expression is affected by ABA, mRNA populations from 
developing rice seeds were analyzed by in vitro translation 
and SDS-PAGE of the protein products. This experiment 
identified mRNAs encoding prominent polypeptides of mol 
wts 45, 39, 30, 25, 23, 21 and 20 kd which accumulate late 
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during nee embryogenesis (Figure 1A, lanes 1 -5, see dots). 
These mRNAs are long-lived in mature grain harvested 30 
days after flowering (DAF) (Figure 1A, lane 5) and 
disappear completely during normal germination (Figure 1 A, 
lane 6). However.their accumulation can be recapitulated 
Airing germination by a 12-h incubation with ABA (Figure 
1 A, lane 7). These results indicate that the levels of mRNA 
encoding these polypeptides are modulated by ABA. 

Two-dimensional separation of these translation products 
revealed that the 23-, 21- and 20-kd polypeptides are com- 



ABA and water-stress responsive rice gene 
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prised of multiple isoelectric forms (Figure 1C, see arrows). 
The even spacing of spots suggests that some of this hetero- 
geneity may be due to serial charge differences, a common 
eiectrophoretic artefact. Alternatively, some of the different 
isoforms may be products of different genes (see Discussion). 
Differential labelling experiments suggest that the 23-, 21- 
and 20-kd polypeptides are rich in methionine but lacking 
in cysteine (Figure 1A, lanes 7 and 8). 

A cDNA library was constructed using the ABA-treated 
seed mRNA as template. ABA-rcsponsive cDNA clones 
were isolated by differential screening and subsequent North- 
ern blot analysis. One clone, called pRAB 21, was chosen 
for further characterization. Figure ID shows that pRAB 



21 hybridizes to mRNA(s) that encode the prominent poly- 
peptide^) of apparent mol. wt 21 000. At the low hybridiz- 
ation stringency shown here, mRNAs encoding the 23- and 
20-kd polypeptides are also hybrid-selected (the 23-kd group 
is very faint in this exposure). At higher stringencies of 
hybrid-selection, only the 21-kd polypeptides are seen (not 
shown). This cross-hybridization, together with several lines 
of evidence discussed later, indicate that the mRNAs for the 
23-, 21- and 20-kd polypeptides are homologous. 

RAB 21 is a basic, gtycine-fich protein that 

accumulates in the cytosd of ABA-treated ceils 

The amino acid sequence of RAB 21 was determined from 
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ofhydropotucaJly grown rice following addition of 10 uM ABA to 

25^ 'TJ "5 ^ » *oots of «he same plants following 
™°"jf I0 J^. ABA to media (tones 5-8), addition of N fci 
MS (lanes '-»)• and air-drying me whole plants 
k il °" ng a^wnplished by removing plants from 
growui ^media and air-drying on a laboratory bench at 27'C for the 
indicated times. Arrows mark the single major transcript of 850 bases 



the nucleotide sequence of the RAB 21 cDNA and genomic 
'I 0 "" (** F 'g"re 5). The RAB 21 open reading frame 
^J^ 10 93 " TGA 665 > encodes a polypeptide of mol. 
wt 16 529 (Figure 2A), roughly 4500 smaller than the mol 
wt predicted from the mobility of the RAB 21 protein on 
SDS-PAGE. However, several lines of evidence suggest 
that this is the correct sequence of the RAB 21 protein. First, 
the ORF encodes eight methionines and no cysteines, con- 
sistent with the results of in vitro translation experiments with 
labelled cysteine or methionine (Figure 1A). Second, the 
ORF-encoded polypeptide is basic (approximate pi = 9 4) 

Sd^SRI?^* m6bihty of me *** 21 P°lypeptide(sj in 
NEpHGE (Figure ID). Third, the GC content of the ORF 
is high (70%) while that of the 5'- and 3 '-untranslated 
regions is low (45%). This GC-codon bias has been noted 
in the coding regions of other seed protein genes (Rogers 
etal., 1985). To obtain direct confirmatory evidence, anti- 
bodies were raised against portions of the RAB 21 protein. 
Figure 2B and C show that antibodies against a /3-galacto- 
sidase/RAB 21 fusion protein (RAB 21 residues 15-163) 
specifically recognize in vitro and in vivo synthesized poly- 
peptides which correspond to RAB 21 . Furthermore, fusion 
proteins with the RAB 21 cDNA in all three reading frames 
were produced in Escherichia coli and analyzed on Western 
blots with polyclonal antibodies raised against an extract of 
total soluble proteins from mature rice seed. Only the 'in 
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S'tI^ 5 " 5 ^ *** 21 in *"F*™°n cells. 

(A . course of accumulation of RAB 21 mRNA in suspension 

of *T (B) efta of (""incubation with 
mnibuors of protein synthesis. RAB 21 mRNA levels after 6 h 
moibauons without tone 1) and with 10 pM ABA (lane 2). Lanes 3 

^ihi^f iK ? ba « ions fo »<^"8 a 1.5-h pre-incubation with 

inhibttors of protein synthesis (cyclohewnide (10 /iM), anisomycin 
. chloramphenicol (100 iM)}. The arrows mark the two 
different transcnpts which accumulate. The size of the smaller 
transcript is 830 bases. 



frame" fusion was immunoreactive (results not shown) 
demonstrating conclusively that the other two possible 
reading frames are not translated in vivo. 

In vitro transcription/translation of RAB 21 cDNA con- 
structs was used to show that the predicted stop codon (TGA, 
665) terminates the RAB 21 ORF. In these experiments, 
Ptzl9U vectors (USBC) containing either the Nhel-Sacl 
v 5 / -670) fragment which terminates at the expected TGA 
or the Nhel-Nhel (57-741) fragment, were used as tem- 
plates to produce polypeptides in vitro. These two polypep- 
tides had identical mobilities in SDS-PAGE (not showhh 
coirfmung that TGA at residue 665 is used as the stop codon. 

Analysis of the RAB 2 1 protein sequence using Harr plots 
revealed that the protein contains a duplicated domain struc- 
ture. Each domain contains an A and B repeat (Figure 2A) 
The A and B repeats are adjacent in the C-terminal domain, 
k ^i eparated "> "e N-terminal domain by a portion of 
the ORF containing the gene's single intron. An algorithm 
which predicts the secondary structure of RAB 21 suggests 
that most of the polypeptide, including the A repeats, is 
composed of random turns due to the irregular spacing of 
glycine residues. The B-repeats are very basic and contain 
the conserved sequence KKEKLPG which may be part of 
a neb* bounded by turns. Further computer analysis showed 
weak homology between the entire protein (or the A repeats 
alone) to regions of several viral nuclear proteins. The 
highest homology occurred with the glycine -alanine co- 
polymer domain of the major nuclear antigen encoded by 
the Epstein -Barr vims (Hennessy and Kieff, 1983). Not 
surprisingly, the very basic, lysine-rich B-repeats are weakly 
homologous to various DNA-binding proteins such as his- 
tones These results suggest that the RAB 21 polypeptide 
may be a nuclear protein. To examine this possibility, RAB 
21 antibodies were used to probe soluble, organellas and 
nuclear fractions prepared from leaves of ABA-treated rice 
plants. These experiments show that the RAB 21 protein is 
found in the soluble fraction (Figure 2C). Since the protein 
has not been detected in crude organellas chloroplastic, 
mitochondrial or nuclear extracts, we suggest that RAB 21 
is a cytosolic protein. However, definitive proof of its cellular 
location remains to be established by immuno-electron 
microscopy. 
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ACACCATCACCCTTCTCACCACCCTCATCTACCACCCTCCTTZACCACTCCCTCACTCCCACCACCCTCTCCCCCCCCCTCCAATACAACAACAAACATC 
CAACCCTCATATAACCTAACACCCACTACCCATTTAACCTATCATTCCCACAATCTAATCCACTTATTTCTCTTCCCATCATCTTATCCTCTCATTTCTC 

ctcactacttttccatttctaccaaacacaatcacaccctc;aacaaacctcctccaccaccctacccaccaatcaccaaaacacacaccccaccacctc 
cccaccccccatccccacccccatcacacaaccccacccaaacacccaccaccaccttccccatcatcctcctcttcccccaccacctcactccccatcc 
caatcacacccccccaccccaacacaaaaaccccccaccatcccccccaccaccctactaccatcaaccaacatcatctcaaacccaccaccccaccacc 

CTTCCACCTCTCCCCCCCACCTCAACACCCTCATCTCTACAACCACATCTTACACCACATCCCCACCCAACTTACCCCTCCAACTCACCCCACTCCATAT 
CCCTCCTTCCCCTCCTCCTCATCTACACAATCACCTTCCTTATCTCTCCATACTTCTCCTTTCTTTCCCCTCCCCCAAATACAACAACACACACCTCCCT 
TTTCTTCCACACTTTTACTCCTATTCTACCCCTATTTCTAATTTTCTTCTACTTTATTCTATTAATCAATAAACCTCTTTCATTCTATTTTCACTCAATC 
TTCAATCCATTCATCTCTTCCTCTTCCACTCACTATCTTACAATATTCATTCCCTTCAAACAATCTTCCTTAACCCTTCCAACATTTTTATCTCTTCCCT 
CAAACATCCCTAATATTTTCCTTCAAACAATTTTTATCCCaCaCCACCCTCCAACAATTTACACCAATTTCCACCTCTCATACATACCACTCCCCAACTC 
AAACTCACCACCACTTCAAACCTATACAAACTCAACTTATTCATCTAAAACACCCCACACATCCCCCCTCCCCCTCCCTCCCA/.ACCACACCCTTCACCC 
CCATCACCCATTTATTTTTTAAAAAAATATTTCAACAAAAAACACAACCCATAAAATCCATCCAAAAAAAAAAACTTTCCTACCCATCCTCTCCTATCTC 
CATCCACCCCCACCACTCATCCAAACCCTCCATCCACCCCCACACTACACACACATACTTATCCTCTCTCCCCCCCATCACTCACCACCCCTCTCTTCCA 
CAAACCCCTCCCCCCACACCCTACCTCCCCCACCCCCCCCCrrCCCCCCTCCACACCTCCCCCTCCTCTCCCCCCCCCCTCCCCACCCTCCACCCCCTCC 
CCCACACCTCTCCCTCTCTCCCTCCACCCATCCCCTCC£AAi:CACCTCATCTCCTCCCCTCCTCCCCCTIAI^A.ATCCCCCCCACCACCTTCACCTCCT 

TCCACACCACACCAACACCTAACTCACCTACCCACTCATCACAACAACACCTCCATCTCTCACACTCTTTTTTCACCTTTACCTTAACCACCAJfiCACCA 

CCACCCCCACCACCCCCACCTCACCACCCCCCTCCACCACTACCCCAACCCCCTCCCCACCCCCCCCCCACACCCCCACATCCCCACCCCCCCCATCCCC 

ACCCACCCCACCCCCCCCACCCCCCCCCACTTCCACCCCATCACCCACCACCACAACACCCCCCCCCTCCTCCAACCCTCCCCCACCTCCACCTCAACCT 

CCgt«c»«c«CCttg«ccccc««ttccctACCCcec«ct««*«cetCgcgt*c*«ttcstCg«*««tttt«Atgtcttjt j»e«gTCTCACCATCATCCA 

ATCCCACCCACCACCAACAACCCCATCAACCACAACATCAACCACAACCTCCCCCCCCCCAACAACCCCCACCACCACCATCCCATCCCCCCCACCCCCA 

CCCCCACCGCCACCCCCACCCCAACCCCCCCCCCCTACCCCCACCACCCCCACCCCACCCCCATGACCACCCCCACCACCCCCCCACACCCCACCACCAC 

CACCCACACCCCCCACAACAACCCCATCATCCACAACATCAACCACAACCTCCCCCCCCACCAClfiACCTCCACACACCACCACACCATCTCTCCCTCCC 

CCCCACCCCCCCCACCTCACCTTCCTCAATAATAACATCACCTACCCCACCCCAATAAAACCAAAAAAAAATCTTACTGTCCTCTCATCACTCTCAATCT 

CTCATCCCCTTCTCCACTACCCACCCTCTCTCCCTCTTCTTTCTACTCTCAAACTACCCTCTCTATCTACCTCCTATCTCTACAATTCCCCTTATACTCT 
• poly A* .« poly A*. 

TCTTCAAATACCTATAAATATCTCTACATTATACCCTCTATATCTCATCCTCCATATCTACACAACCTTTTCCTCATCCTTATAATCTTCATTTTTTTCC 
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Fig. 5. Structure of the RAB 21 cDNA and gene. (A) Restriction map of the 4.3-kb BamHl genomic fragment containing the RAB 21 gene. The 
map was generated by multiple enzyme restriction analysis. The two Sacl sites are at -52, downstream of CAAT, and at +670, downstream of the 
TGA terminating the ORF (see Figure 2B). (B) Nucleotide sequence of the RAB 21 gene. The 2.4-kb Pstl-Xhol genomic fragment (see A) and the 
cDNA were sequenced as described in Materials and methods. The ORF is in bold-faced type, the single intron is in lower case. Putative 
transcription and the start and stop codons of the ORF are underlined. Polyadenylation sites found in the cDNAs are marked with an asterisk. 
Various GC-rich repeats in the proximal promoter region, including five sequences which resemble the consensus binding sites of the SP1 Mumal 
transcription factor (Briggs et a/., 1986) are discussed in Results. (Q Primer extension using a synthetic oligonucleotide (+84 to +J^;7**\ end ~ 
labelled oligo was used as a primer to reverse transcribe without RNA (panel 1) and with 1 jig of poly(A) + RNA from ABA-treated l™*^ 6 * 
without (panel 2) and with subsequent incubation with SI nuclease (panel 3). Panel 4 shows the 5' sequence (GATC top to bottom) of the pRAB 21 
cDNA obtained with the same oligo as primer. 
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Accumulation of RAB 21 mRNA is induced in 
different tissues by ABA and water-stress 
Northern Wot analysis was used to analyze the steady-state 
levels of RAB 21 mRNA in various rice tissues during 
different treatments. Experiments with total RNA from 
developing seed tissues (Figure 3A) confirmed the patterns 
Z. accumulation as assayed by in vitro 

^nsdaoon (Figure 1 A). During seed development, low levels 
ot RAB 21 mRNA are found in both endosperm and embryo 
seed halves (Figure 3A, lanes 1-4). High levels of this 
mRNAaccumulate in embryos between 20 DAF and ma- 
turity (30 DAF) and survive as long-lived mRNAs in the 
resting grain (Figure 3A, lane 5). Homologous signals were 
also seen among long-lived RNAs extracted from mature 
seeds of maize, barley, wheat and millet (not shown), sug- 
gesting that proteins homologous to RAB 2 1 are functionally 
conserved among the Graminae. As was shown for barley 
(Mundy «r ol., 1986), mature rice endosperm does not 
contain detectable mRNA levels. Therefore RAB 21 mRNA 
was not assayed in this tissue. 

The long-lived RAB 21 mRNA in rice embryos is rapidly 
turned over at the onset of germination (Figure 3A. lane 6) 
However, accumulation of this mRNA can be recapitulated 
by incubating the germinating embryos in 10 fiM ABA 
(Figure 3A lane 7). This indicates that ABA affects the 
steady-sate level of RAB 21 mRNA. 

K^i 1 u S T presenl at ^ectable levels in the roots and 
shoots of hydroponically-grown, 1-month-old rice plants 
(Figure 3B, lanes 1 and 5). However, upon addition of ABA 
to Ae solution bathing the roots, the mRNA accumulates 
in both organs and reaches a steady-state level after 12 h 
(Figure 3B, lanes 2-4, 6-8). At this time the level of RAB 
21 mRNA is induced at least 20-fold over the control 
Therefore the increase of the RAB 21 mRNA levels in the 
presence of ABA is not restricted to seed tissues. 

To determine if the expression of RAB 21 could also be 
induced by water-stress, NaCl was added to the hydroponic 
solution to a final concentration of 200 mM. Figure 3B (lanes 
v U) indicates that RAB 21 mRNA levels increase in 
response to prolonged growth in solutions containing elevated 
salt concentrations. The time course of induction in response 
to NaCI is similar to that obtained with ABA; the maximal 

'I f U ?, Ulation (2 °" to 3 °- fold induction) being at- 
tained after 12-24 h. The pattern of RAB 21 mRNA 

3 ^ Was ""d^ged when both ABA and salt were 
a f ded to ™ hydroponic solution. RAB 21 mRNA levels are 
f^l^t y des , iccation of nce P'ants (Figure 3B, lanes 

of p a r ^T 6 ,^* Stn>ngly ^ accumulation 

or RAB 21 mRNA is regulated by the water status of the 

Detectable RAB 21 mRNA accumulation in whole rice 
plants generally requires 3-4 h of ABA treatment. This long 
response time may reflect a low rate of exogenous hormone 
uptake by roots and translocation to the leaves. To examine 
this possibihty, RAB 21 mRNA accumulation was measured 
ui cultured suspension cells derived from embryogenic calli 
In contrast to whole plants, the increased RAB 21 mRNA 
levels are easily detected in suspension culture cells after 
only 15 mm of ABA treatment (Figure 4, lanes 1 and ? 
During prolonged incubation with ABA, RAB 21 mRNA 
levels increase steadily to a maximum of -20 tin^he 
1C% l at 3 ~ 6 h ( F ^re 4, lanes 3-6). 
The rapidny of the ABA response in cell cultures suggests 



that protein synthesis is not required for RAB 21 gene 
expression. To obtain direct evidence on this point, ceffis 
were treated with a combination of protein synthesis inhibi- 
ors affecting 70S and 80S ribosomes which reduced total 
proteu, synthesis by > 90%. The protein synthesis inhibitors 
ne.tr.er induce the expression of RAB 21 (Figure 4, lanes 
1 and 3) nor block its induction by ABA (lanes 2 and 4, 

VSS TSZ (,0 2" 1200 bases) ^ h y bridizes » 

fte RAB 21 cDNA probe accumulates in the presence of 
Ae inhibitors. These results have also been seen in leaf 
assues treated with ABA plus inhibitors (results not shown,. 
This longer transcript has not yet been characterized but a 
may correspond to the primary transcript of a homologous 
gene which contains an intron of 300-400 bp. 

Structure of RAB 21 gene 

The nucleotide sequence of the pRAB 21 cDNA was used 
to deduce the primary structure of the encoded polypeptide 
The clone was also used to isolate a corresponding genomic 
done gRAB 211. Analysis of sequences upstrelm of the 
kab 21 gene s coding region identified putative regulatory 
events The restriction map and nucleotide sequence of 
toe cDNA and the corresponding region of the genomic clone 
U.5-kb restriction fragment) are presented in Figure 5A and 
7m JL^™ 0RF ° f the transcribed region is 489 bp 
(V3 -665) encoding the 162 amino acids of RAB 21 (Figure 
2A). Flanking the ORF is a 92-bp 5' leader containing three 
stop codons and a 261-bp 3' tail containing the putative 
polyadenylation sequence ATAAA 12 bp upstream of the 
sue of poly(A) addition. The 3'-non<oding regions of six 
homologous cDNAs were sequenced. Five of them were 
identical to pRAB 21, being polyadenylated at position 
+vzo. This correlates well with the size of the RAB 21 
mRNA (800 nucleotides). The sequence of the sixth clone 
was identical to that of the genomic DNA but contained a 
pory(A) tail farther downstream at position +991 The 
ongmal 824-b P cDNA was shown by primer extension to 
tok only 18 nucleotides from the 5'-untranslated leader 
(Hgure 5C). The transcription start site is shown in Figure 
2B (nucleotide +1). 6 

In the genomic DNA, the ORF is interrupted by a 83-bp 
AT-nch intron flanked by the consensus border sequences 
,kI nKu n .^ sequence of gRAB 2 1 1 was identical to 
the cDNAl RAB 21, indicating that this gene is transcribed 
in vivo. The proximal GC-rich promoter (-200 to -1) 

^T 5 /,^ 6 TATAA ^ (- 30 ) ^ a putative CAAT 
V^T i regl0n contains the following four types 
of GC-nch repeats (see Figure 5B): (1) TGCGCCACCG 
n~lJl~ d ~ 12 1;(2) CGCCGCGC at - 167 and - 129- 

This region also contains five other GC-rich repeats at 
-200, - 195, - 166, -133 and -48 whose opposite strand 
homo, °gy to *e decanucleotide (G/ 
TG/AGGCGG/TG/AG/AC/T) binding site of the SP1 tran- 
scription factor (Briggs et ol., 1986). 

Discussion 
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We are interested in studying the molecular mechanism of 
action of plant hormones on gene expression. To begin this 
work we isolated a cDNA encoding a major transcript that 
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is inducible in rice tissues by the plant hormone ABA. This 
cDNA was shown by hybridization and immunoassay to en- 
code a prominent member of a group of basic polypeptides 
of mol. wt 23-20 kd. We call this protein RAB 21. The 
different RAB polypeptides may be post-translational modifi- 
cations of a single gene product or products of closely related 
genes. The rice genome contains at least three closely-linked 
(within 30 bp) genes homologous to RAB 21 (K.Yamaguchi- 
Shinozaki, unpublished results). These genes encode pro- 
teins of slightly different amino acid sequence that may ac- 
count for the different groups of polypeptides 
immunoreactive to RAB 21 antibodies. 

Northern blot hybridizations show that RAB 21 gene 
expression is not tissue specific, as shown by the accumu- 
lation of its transcript in seeds, rows, leaves and in un- 
differentiated suspension cells. Since there at least three other 
rice genes closely related to RAB 21, gene-specific probes 
are needed to ascertain whether these genes are differentially 
expressed. Physiological experiments show that RAB 21 
mRNA accumulates not only in ABA-treated tissues but also 
in leaves, roots and suspension cells under conditions of 
water deficit. These results suggest that RAB 21 gene ex- 
pression is dependent upon the water status of plants and 
that ABA may act as a signal in this response. The pathway 
of this response is different to that mediating the more general 
heat shock response (Heikkila et ai. 9 1984), because RAB 
21 mRNA is undetectable in rice tissues after heat shock 
for various periods of time (not shown). 

Experiments with cultured cells show that accumulation 
of RAB 21 mRNA following ABA treatment is very rapid 
and that it is insensitive to inhibitors of protein synthesis. 
These data strongly suggest that ABA-induced gene ex- 
pression does not require protein synthesis but probably 
involves modification of pre-existing factors, as is the case 
for the heat shock response (Zimarino and Wu, 1987). NaCl 
also induces RAB 21 mRNA rapidly in cultured cells (not 
shown). This response is not additive to that attributed to 
ABA: addition of NaCl and hormone together does not 
"superinduce 9 mRNA accumulation at any point in the time 
course. These results corroborate the findings of physio- 
logical (Jones et al., 1986) and genetic (Chandler et a/., 
1988) studies which show that the response of plants to 
water-stress is mediated by ABA at the level of specific gene 
expression. 

To initiate studies on the molecular mechanism of ABA 
action, the gene encoding RAB 21 was isolated and its 
nucleotide sequence determined. This is the first published 
genomic sequence of a strongly ABA-responsive gene. The 
proximal promoter region is GC-rich and contains numerous 
repeats detailed in Results. Another group of repeats is 
closely related to to the GC element found in various cellular 
and viral genes in mammalian cells. This m-acting element 
promotes the expression of genes by binding the trans-actmg 
protein factor SP1 (Briggs et al., 1986). Similar sequences 
have been noted in the promoter of the or-subunit of 0- 
conglycinin (Chen et ai., 1986) and in that of oat phyto- 
chrome (Hershey et al.\ 1987), plant genes which are not 
known to be responsive to ABA treatment. The regulatory 
roles of these different GC-rich repeats remain to be estab- 
lished by functional assays. These experiments, now in 
progress in our laboratory, may elucidate the molecular 
mechanism by which ABA regulates gene expression and 
mediates the adaptation of plants to water-stress. 



Materials and methods 

Plant materials 

Seeds of rice (Oryzo saliva, var. Indica, cv. IR 36) were obtained from 
the International Rice Research Institute, Philippines. Plants for developmen- 
tal studies were grown in soil at 27°C and a day length of 11 h. Plants 
flowered 10- 1 1 weeks after planting. Seeds were then collected after 10 
days (milk stage), 20 days (starchy, green pericarp) and 30 days (dry, brown 
pericarp, mature). Plants were also grown hydroponkally in Hoagland's 
solution supplemented with 20 mM NH 4 N0 3 . Preparation and treatment 
of embryo-containing half-seeds at 27 °C was performed as described 
previously for barley (Mundy et ai., 1986). Suspension cells derived from 
embryonic callus of IR 36 were grown at 25 °C in ftao's medium (Kao, 
1977) containing 2.5 mg/1 2-4 D and 0.2 mg/1 kinetin with subculturing 
at 1-week intervals. Cells for mRNA isolation were subcultured 5-7 days 
prior to harvest. 

Protein analysis 

Proteins were analyzed by NEpHGE, SDS-PAGE and Western blots 
according to Tingey et ai. (1987). Isolation of mRNA, in vitro translation 
using reticulocyte lysate and irnmunoprecipitation with Protein A Sepharose 
4B were as described previously (Mundy et ai., 1986). Leaf cell fractions 
were prepared from 7 -day -old plantfcts sprayed three times with 100 /iM 
aqueous ABA solutions during the 24 h prior to harvest. Chloroplasts and 
mitochondria were prepared from leaf tissue after Boutry and Chua (1985) 
while nuclei were prepared after Green et ai. (1987). The National Bio- 
medical Research Foundation Protein Sequence Databank carried in the 
Rockefeller University 7000/40 computer was screened for sequences related 
to the RAB 21 ORF with the SEARCH program of Dayhoffrt ai. (1983). 
Detailed comparisons of protein sequences thought to be related to the RAB 
21 sequence were made with ALIGN, utilizing the mutation data matrix 
[250 PAMs = 6, and a gap penalty of 8 (DayhofT et ai., 1983)]. 

Isolation of cDNA and genomic donas 

Double-stranded cDNA synthesized by the RNase H method (Gubler and 
Hoffman, 1983) was size-fractionated on a column of Bio-gel A50m (Bio- 
Rad Laboratories). Molecules of 450 -4500 bp were then doned by homo- 
polymer GC-tailing into pEMBL 12 plasmid (Dente et ai., 1983). Six 
thousand recombinant clones, replica-transferred onto nitrocellulose filters 
from 96- well microtiter plates, were screened with single-stranded cDNA 
probes synthesized from mRNA isolated from control and ABA-treated half- 
seed mRNAs. Hybridizations were performed in 50% formamide, 6 x SSC, 
1 x Denhardt's, 0. 1 % SDS, 100 /ig/ml denatured salmon sperm DNA at 
42°C with denatured 32 P-labelled DNA probes (sp. act 1 x 10 s c.p.m./ 
/ung DNA, final concentration 1 x 10* c.p.m. /ml). Sixty ABA-responsive 
clones were then tested on Northern blots of the same RNAs. Hybridiz- 
ation selection after Tingey et ai. (1987) was used to identify polypeptides 
encoded by specific clones. Standard protocols and conditions were used 
for agarose electrophoresis, hybridization in 50% formamide, DNA fragment 
isolation with DE-81 paper, and plasmid DNA preparation (Maniatis etal. t 
1982). 

Genomic DNA was isolated from 10-day-old etiolated leaves by CsCl 
centrifugation (Maniatis et ai., 1982). Southern hybridization of restricted 
DNA revealed a 4.3-kb Xbal fragment which hybridized strongly to the 
selected RAB 21 cDNA. This genomic fragment was partially purified by 
size fractionation in agarose gels and then cloned by insertion into Xbal- 
digested lambda ZAP (Stratagene). A total of 75 000 recombinant plaques 
were screened on duplicate filters and three clones containing identical 4.3-kb 
Xbal inserts were identified by hybridization to pRAB 21. Excision and 
recircularization of pBLUESCRIPT SK(ml3) plasmid following super- 
infection of lambda ZAP infected cells with IR 408 helper phage (Russel 
et ai., 1986) was performed according to the manufacturer's instructions 
(Stratagene). 

DNA sequencing and primer extension 

The pRAB 21 cDNA insert and a 2.5-kb Pstl-Xhol genomic fragment 
taming the RAB 2 1 sequences were sequenced from overlapping deletions 
created by BoBX exonuclease (Misra, 1985). Single-stranded templates were 
prepared by superinfecting pEMBL containing recombinants with the IR 
408 helper phage (Russel et a/., 1986). Sequencing reactions were performed 
according to Biggin et oi. (1983) and the products separated on 6% poly- 
acrylamide/urea gels. Inosine was used to resolve GC -compressions. More 
than 90% of the sequence was obtained for both strands of DNA. At least 
two overlapping clones were used when only one strand was sequenced. 

A 24-base oligonucleotide corresponding to 5' sequences of pRAB 21 
was synthesized on an Applied Biosystems model 380A DNA synthesizer 
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after the manufacturer s instructions The eel-nurifi^ „r 
ft«fan iproftrih anrf antibody production 

0-galacwsidase. Fusion pLeL Lnf^ 0 ^,? ° f 
galactosidase-SeDharose^™ !l? from <*'' extracts on anti-fl- 

tions (PromegaliS were *° "* ™"><**»^ 

6 iwccnj were used for immunization of rabbits. 

UNA bbt analysis 
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Subdomains of the Octopine Synthase Upstream 
Activating Element Direct Cell-Specific Expression in 
Transgenic Tobacco Plants 

Halina Kononowicz, 1 Y. Eureka Wang, Lillian L. Habeck, and Stanton B. Gelvln 2 

Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907 



Previous work has shown that the octopine synthase (ocs) gene encoded by the Agrobacterlum tumefaciens Tl-plasmld 
contains an upstream activating sequence necessary for Its expression In plant cells. This sequence Is composed of 
an essential 16-bp palindrome and flanking sequences that modulate the level of expression of the ocs promoter in trans- 
genic tobacco calll. In this study, we have used RNA gel blot analysis of RNA extracted from transgenic tobacco plants 
to show that the octopine synthase gene Is not constltutively expressed In all plant tissues and organs. This tissue- 
specific pattern of expression Is determined, to a large extent, by the 16-bp palindrome. Histochemlcal analysis, using 
an ocslacZ fusion gene, has Indicated that the 16-bp palindrome directs the expression of the ocs promoter in specific 
cell types In the leaves, stems, and roots of transgenic tobacco plants. This expression Is especially strong In the vascu- 
lar tissue of the leaves, leaf mesophyll cells, leaf and stem guard ceils, and the mertstematlc regions of the shoots and 
roots. Sequences surrounding the palindrome in the upstream activating sequence restrict the expression of the ocs 
promoter to fewer cell types, resulting In a reduced level of expression of p-galactoskfase activity In the central vascular 
tissue of leaves, certain types of leaf trichomas, and the leaf primordia. 



INTRODUCTION 



Many eukaryotic genes contain upstream activating se- 
quences or enhancer elements that are essential for gene 
transcription. These sequences are not promoters in them- 
selves but activate transcription from linked promoters. They 
are independent of their orientation relative to the promoters 
and can be either independent (enhancers) or dependent 
(upstream activating sequences) upon their position relative 
to these promoters. Enhancers can be tissue specific or in- 
ducible. They may also be modular, in that they may be com- 
posed of a limited number of basic sequence motifs that inter- 
act in a synergistic fashion (for a review, see Wasylyk, 1988). 

Upstream activating sequences have been identified for a 
number of plant genes. Among the best studied of these acti- 
vators is that of the cauliflower mosaic virus (CaMV) 35S pro- 
moter. The CaMV 35S activator has multiple domains that 
can function either independently or synergistically to acti- 
vate the 35S promoter in a developmental^ and tissue-spe- 
cific manner (Benfey et al., 1989, 1990a, 1990b). Upstream 
activating sequences have also been identified among genes 
of the Agrobacterium tumefaciens Ti-plasmid encoded by 
T-DNA. T-DNA genes containing such elements include those 
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encoding enzymes involved in cytokinin biosynthesis (tmr, 
de Pater et al., 1987), agropine biosynthesis (ags, 
Bandyopadhyay et al., 1989), nopaline production (nos, An, 
1987; Mitra and An, 1989), mannopine biosynthesis (mas, 
DiRita and Gelvin, 1987; Langridge et al., 1989; Letsner and 
Gelvin, 1989; Comai et al. t 1990), octopine production (ocs, 
Ellis et al., 1987a, 1987b; Letsner and Getvin, 1988, 1989; 
Bouchez et al.. 1989; Fromm et al., 1989; Singh et al., 1989), 
and a gene of unknown function encoding a 780-base mRNA 
(Bruce et al., 1988). 

Recently, we (Leisner and Gelvin, 1988, 1989) and others 
(Ellis et al., 1987b) have shown that the octopine synthase 
(ocs) upstream activating sequence contains a 16-bp palin- 
drome essential for ocs activator function. Specific proteins 
can bind to this palindrome (Bouchez et al., 1989; Fromm et 
al., 1989; Singh et al., 1989, 1990; Tokuhisa et al., 1990) and 
presumably are involved in the activation of the ocs promoter. 
In transgenic tobacco call!, however, sequences surrounding 
this palindrome are necessary for maximal stimulation of ocs 
transcription. These additional sequences include an ele- 
ment with the potential to form a Z-DNA structure (Leisner 
and Gelvin, 1989). In this study, we have examined the in- 
fluence of various DNA sequences from the ocs transcrip- 
tional activating element upon ocs promoter expression in 
transgenic tobacco plants. We show that the complete ocs 
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activator confers tissue-specific expression upon the ocs 
promoter. Moreover, DNA sequences surrounding the 16-bp 
palindrome within the ocs activator limit the expression of an 
octopine synthase-p-galactosidase fusion protein to particu- 
lar cell types. 



RESULTS 



RNA Get Blot Analysis of ocs RNA from 
Plant Tissues and Organs 

In previous studies (Leisner and Gelvin, 1988, 1989), we in- 
vestigated the importance of various sequences upstream of 
the octopine synthase gene in the activation of the ocs pro- 
moter. These experiments utilized calli derived from trans- 
genic tobacco plants transformed with constructions contain- 
ing the ocs promoter and structural gene and various 
fragments of the ocs upstream activating sequence. In 
tobacco calli, octopine synthase activity directed by these 
constructions correlated with the steady-state level of ocs 
mRNA (Leisner and Gelvin, 1988). We wished to extend 
these studies and investigate the tissue-specific and de- 
velopmental utilization of these ocs activator sequences in 
mature tobacco plants. We, therefore, regenerated plants 
from these transgenic tobacco calli and assayed octopine 
synthase activity in various tissues and organs of these 
plants. Control experiments indicated, however, that certain 
plant tissues contained substances that inhibited the octo- 
pine synthase assay (data not shown). 

Because octopine synthase activity was not a reliable indi- 
cator of ocs gene activity in mature transgenic tobacco 
plants, we performed an RNA gel blot analysis of RNA de- 
rived from these plants to examine tissue-specific patterns of 
ocs mRNA accumulation. RNA was extracted from leaves 
(five leaves equally distributed along the stem), stems (three 
segments), flowers, the root tip, and the root base. Total cellu- 
lar RNA (20 \ig) was fractionated by formaldehyde-agarose 
gel electrophoresis, blotted onto nitrocellulose, and hybrid- 
ized with a DNA fragment containing the ocs gene. Figure 1A 
shows the regions of the ocs transcriptional activating se- 
quence that were present in the different plants, and Figure 
1B shows the different parts of the plant that were examined 
for ocs mRNA. A total of 15 plants harboring the different con- 
structions were analyzed to determine the amount of ocs 
RNA. Figure 1C shows RNA gel blot analyses of RNA derived 
from plants containing the constructions pENI, pAlu106, and 
pPAL16. There was, in general, a relatively high steady-state 
level of ocs mRNA in the leaves and the root tip, a lower level 
of ocs mRNA in the stem (especially the lower stem sections) 
and flowers, and a very low level of ocs mRNA in the basal 
sections of the root. Plants containing the constructions 
pENRI, pAITaR54, and pAlu45 showed similar patterns of ocs 
mRNA accumulation (data not shown). Hybridization of the 



blots with an rDNA probe indicated that the lanes were 
equally loaded (data not shown). These data suggest that the 
ocs promoter is differentially expressed in different tissues 
and organs of mature tobacco plants. The data further sug- 
gest that, at the level of resolution examined in this series of 
experiments, different fragments of the ocs transcriptional 
activating sequence do not alter the expression pattern of 
ocs mRNA. Although the differing steady-state levels of ocs 
mRNA may also be explained by differential ocs mRNA sta- 
bility, similar patterns of expression were observed using an 
ocs::lacZ fusion gene (see below). It is, therefore, likely that 
the observed steady-state levels of ocs mRNA resulted from 
tissue-specific differences in expression of the ocs promoter. 

Histochemical Localization of P-Galactosidase 
Activity in Transgenic Plants Containing 
ocsr.lacZ Fusion Constructions 

To determine in which plant cell types various portions of the 
ocs transcriptional activator function, we generated trans- 
genic tobacco plants harboring translations fusions of the 
ocs gene and the Escherichia coiilacZ gene. These construc- 
tions all contained the ocs promoter but differed in the extent 
of the ocs transcriptional activator sequences employed. The 
translational fusion was the same as that used by Teeri et al. 
(1989). The constructions used are similar to some of those 
shown diagrammatically in Figure 1A, except that a gene en- 
coding an octopine synthase-p-galactosidase fusion protein 
substituted for a gene encoding the complete octopine syn- 
thase protein. 

The use of p-galactosidase activity as a histochemical 
marker in plant tissues depends upon the ability to distin- 
guish between such activity encoded by the lacZ gene and 
endogenous P-galactosidase activity present in plant tissues. 
The endogenous plant activity can be eliminated by fixation 
of plant tissues with glutaraldehyde. We found that to abolish 
endogenous p-galactosidase activity we had to modify the 
protocols of Teeri et al. (1989) (see Methods). Using such 
modifications, we could completely eliminate the endoge- 
nous activity from plants containing the construction 
pOCSA2 and, therefore, not expressing an £ coli tacZ gene 
(data not shown). In all further experiments, control plants 
containing the construction pOCSA2 were examined parallel 
to the experimental plants. Only in those instances in which 
the control plants showed no endogenous activity did we ex- 
amine further the experimental plants for /acZ-encoded 
P-galactosidase activity. For each construction, multiple tis- 
sues taken from three independent transformants were ex- 
amined. There were no differences in the patterns of P-galac- 
tosidase expression among the individual transformants for 
a given construction. 

Figure 2 shows sections taken through the leaves of trans- 
genic tobacco plants harboring the constructions pEN1, 
pAlu106, P AIRa45, and pPAL16. The second vertical column 
(Figures 2B, 2E, 2H, and 2K) shows that, in all constructions, 
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Figure 1. RNA Gei Blot Analysis of ocs mRNA Extracted from Differ- 
ent Organs and Tissues of Transgenic Tobacco Plants Containing 
Various Portions of the ocs Upstream Activating Element 

(A) Schematic diagram of the portions of the ocs upstream activating 
sequence used in each construction. The arrow indicates the orienta- 
tion of the upstream activating sequence fragment (left to right is the 
native orientation). The open box indicates the position of the 16-bp 
palindrome. The numbers above the map indicate the position rela- 
tive to the start site of transcription (+1). 

(B) Schematic representation of the tissues assayed for ocs mRNA. 
F, flower; LI to L5, leaves from the top to the bottom of the plant, suc- 
cessively; RT, root tip; RB, root base; S1 to S3, stem sections from 
the top to the bottom of the plant, successively. 

(C) RNA gel Wot analysis of ocs mRNA from representative tobacco 
plants containing the constructions pEN1, pAlu106, and pPALia The 
letters above the lanes indicate the organs or tissues, as defined in 
(B), from which the RNA was extracted. RNA gel blot analysis was 
conducted using 20 ug total cellular RNA as described in Methods. 



P-galactosidase activity expressed from the ocs::lacZ fusion 
gene was present in the leaf mesophyll cells. The activity of 
the fusion protein in the vascular tissue was strong in those 
plants harboring the construction pPAL16\ weak (and limited 
to the adaxial phloem cells) in those plants harboring the 
constructions pAIRa45 and pAlu106\ and absent in those 
plants harboring the construction pEN1. Higher magnifica- 
tion photographs of tangential sections through the leaf blade 
showed that for plants harboring any of the four constructions 
the activity of the fusion protein was present in the palisade 
and spongy parenchyma cells but not in the epidermal cells 
or in the *fctalk" cells of the trichomes. The staining of guard 
cells could be detected (e.g. f Figures 2E, 2H ( 2K, and 2L). 
Subtle differences in the staining patterns of the trichomes 
could be detected in plants containing the different ocs tran- 
scriptional activator constructions. Plants containing the con- 
struction pPAL16 showed intense staining in the cells in the 
"head" of both the glandular and nonglandular trichomes 



(Figures 2K and 2L). Staining in the head of the glandular 
trichomes was also strong in plants containing the construc- 
tions pAlu106 and pAIRa45 but was weak in the heads of the 
nonglandular trichomes. In plants containing the construc- 
tion pEN1, only the heads of the glandular, but not the non- 
glandular, trichomes showed P-galactosidase activity. Such 
lack of staining in the heads of nonglandular trichomes in 
pEN1 plants was highly reproducible. 

Figure 3 shows the expression of the ocs. JacZ fusion gene 
in the guard cells of leaves (Figure 3A) and stems (Figure 3B). 
This photograph was taken of tissue from a pPAL16 plant; a 
similar pattern of expression of P-galactosidase activity in the 
guard cells of pEN1, pAlu106, and pAIRa45 plants was ob- 
served (data not shown). Figure 4 shows the histochemical 
staining of p-galactosidase activity in the stems of transgenic 
tobacco plants. The firpt vertical column (Figures 4A, 4E, 41, 
and 4M) shows sections at the apical tip of the plant. Succes- 
sive vertical columns show sections from progressively tower 
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Figure 3. Histochemical Analysis of &-GaJactosidase Activity in the 
Guard Cells of Transgenic Tobacco Plants Harboring the Construc- 
tion pPALia 

Tissue from transgenic plants were fixed with glutaraldehyde, stained 
with X-Gal, embedded, and sectioned as described in Methods. Bars 
« 10 um. 

(A) Tangential section through the leaf blade of a pPAL16 plant show- 
ing guard cells, gc guard cell. 

(B) Section through the stem of a pPAL16 plant showing guard cells, 
gc, guard cell. 



sections of the stem. Activity can be seen in the shoot apex 
and marginal meristem for all the constructions (Figures 4A, 
4E, 4l t and 4M). Only in plants harboring the construction 
pPAL16, however, could activity be detected in the vascular 
bundle and the leaf primordium (Figure 4M). Cross-sections 
of stems from all four constructions showed that the ocsrJacZ 
fusion gene was active in both the inner and outer phloem 
cells but not in the xylem, parenchyma, or epidermal cells 
(Figures 4B f 4F f 4J, 4L f and 4N). In addition, the fusion gene 



was active in the leaf gap areas (Figures 4C, 40, 4G, 4K, and 
40) and in root primordia (Figures 4H and 4P) of plants con- 
taining all four constructions. 

Figure 5 shows that the expression of the ocs::iacZ fusion 
gene in the roots was similar in plants harboring the different 
constructions. Figures 5A, 5C, 5E, and 5G show that this ex- 
pression is restricted to the vascular cylinder and is absent 
in the epidermal and root cap cells. Cross-sections of the root 
(Figures 5B and 5H) indicate that P-galactosidase activity is 
detected in the central vascular cylinder but is absent in the 
epidermis and the cortex. Figure 5F shows that in the upper 
part of a root (>1 cm from the root tip) expression is limited 
to the vascular elements. ocsrJacZ gene activity in an emerg- 
ing root primordium can be seen in Figure 5D. 



DISCUSSION 

We have examined the organ and cellular transcriptional 
specificity conferred upon the octopine synthase gene pro- 
moter by various portions of the ocs transcriptional activating 
element. This element consists of a 16-bp palindromic se- 
quence essential for ocs gene expression (Ellis et a!., 1987a, 
1987b; Leisner and Gelvin, 1988, 1989) and surrounding se- 
quences previously shown to be important for determining 
the quantitative level of expression of the ocs gene in tobacco 
calli (Leisner and Gelvin, 1989). The data presented herein 
further demonstrate that the 16-bp palindrome is largely 
responsible for the tissue and cellular specificity of the ocs 
promoter. No p-galactosidase staining was seen in tissue 
sections from plants containing the construction pOCSA2, in- 
dicating that the 16-bp palindrome is essential to direct ocs 
promoter activity. Sequences surrounding the palindrome, 
however, limit expression of the ocs promoter in certain cell 
types. 



Figure 2. (continued). 

Leaves of transgenic plants were fixed with glutaraldehyde, stained with X-Gal, embedded, and sectioned as described in Methods Bars « 
100 urn. 

(A) Cross-section through the midrib section of a leaf from a pEN1 plant. 

(B) Tangential section through the leaf blade of a pEN1 plant showing a glandular trichoma, e, epidermis; gt, glandular trichome; pp, palisade 
parenchyma; sp, spongy parenchyma. 

(C) Tangential section through the leaf blade of a pEN1 plant showing both a glandular and nonglandular trichome. gc guard cell; gt, glandular 
trichome; ngt, nonglandular trichome. 

(D) Cross-section through the midrib section of a leaf from a pAlu106 plant, aph, adaxial phloem. 

(E) Tangential section through the leaf blade of a pAlu106 plant showing a nonglandular and glandular trichome. ngt, nonglandular trichoma 

(F) Tangential section through the leaf blade of a pAlu106 plant showing a glandular trichoma gt. glandular trichome. 

(G) Cross-section through the midrib section of a leaf from a pA!Ra45 plant, aph, adaxial phloem. 

(H) Tangential section through the leaf blade of a pAIRa45 plant showing a glandular trichome. e, epidermis; gc guard cell; gt, glandular trichome; 
PP. palisade parenchyma; sp, spongy parenchyma. 

(I) Tangential section through the leaf blade of a pAIRa45 plant showing a nonglandular trichome. ngt, nonglandular trichoma 
(J) Cross-section through the midrib section of a leaf from a pPAL16 plant, vb, vascular bundle. 

09 Tangential section through the leaf blade of a pPAL16 plant showing a glandular trichome. e, epidermis; gt, glandular trichome; pp, palisade 
parenchyma; sp, spongy parenchyma. 

(L) Tangential section through the leaf blade of a pPAL16 plant showing nonglandular trichomes. ngt, nonglandular trichome. 
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In transgenic tobacco plants, the accumulation of ocs 
mRNA is not uniform in different tissues and organs. The ex- 
pression of the ocs gene is greatest in the leaves and root 
tips, lesser in the stem and the flowers, and very low in the 
root base. These data suggest that the ocs promoter is not 
active to the same extent in all tobacco cells. The promoter 
from the nopaline synthase (nos) gene, as well as a number 
of other T-DNA-encoded genes and genes from caulimo- 
viruses, also contain essential elements displaying a high de- 
gree of homology to the 16-bp palindrome of the ocs up- 
stream activating sequence (Bouchez et a!., 1989; Cooke, 
1990). The nos promoter is not constitutive in tobacco (An et 
al., 1988). A gradient of activity resulted in higher expression 
of a nosv.CAT fusion gene in the lower leaves and stem sec- 
tions than in the upper leaves and stem sections of the plant. 
The expression of the nos promoter was also high in flower 
sections. Specific sequences in the nos promoter region 
were identified that were responsible for this gradient of activ- 
ity (Ha and An, 1989). 

Our analysis of the expression of the ocs gene in tobacco 
indicates that the ocs promoter has a different tissue specific- 
ity from that of the nos promoter. Using RNA gel blot analysis, 
we have shown that most of this differential tissue specificity 
is conferred upon the ocs promoter by the 16-bp palindrome 
portion of the ocs upstream activating sequence. RNA gel 
blot analysis was unable, however, to identify major differ- 
ences in the tissue-specific pattern of ocs gene expression 
when transcription was directed by different subfragments of 
the ocs transcriptional activating element. That the tissue- 
specific expression of the octopine synthase gene did not ap- 
pear to differ among the constructions in these experiments 
most likely reflects the low resolution of analysis afforded by 
a technique that relies upon extraction of RNA from tissues 
with many different cell types. DNA sequences surrounding 



the homology found within the ocs 16-bp palindrome oiffer 
substantially among plant promoters, however. Because we 
have shown that sequences surrounding the 16-bp palin- 
drome of the ocs upstream activating sequence can affect 
the tissue specificity of the ocs promoter, we speculate that 
the different sequences that surround the related element in 
other plant promoters may likewise affect the tissue specific- 
ity of these promoters. 

The use of a htstochemicaJ staining technique based upon 
the expression of 0-galactosidase activity directed by an 
ocs: :lacZ fusion gene afforded us a much higher level of reso- 
lution of gene expression. This technique allowed us to 
visualize the specific cell types in which the fusion gene 
could be expressed. Thus, using a construction that included 
the entire ocs upstream activating sequence, we were abte 
to detect expression of the fusion gene in the mesophyfl and 
guard cells of the leaves, the cells at the tip of glandular 
trichomes, the internal and external phloem cells and guard 
cells of the stems, procambial and vascular cells in the root, 
and cells in meristematic portions of the plant. These meri^ 
stematic regions included the shoot apex, leaf and root pri- 
mordia, and cells in the leaf gap area of the stem. Although 
different constructions incorporating various subfragments of 
the ocs upstream activating sequence did not show greatly 
differing cellular patterns of gene expression, subtle but re- 
producible differences were consistently noted. These differ- 
ences were most noticeable in the vascular tissues of the 
leaves and apical stem sections and in the leaf primordia. Ex- 
pression of the ocs::lacZ fusion gene was undetectable in 
these cells in plants harboring the construction pENl that 
contains the complete ocs upstream activating sequence. Ex- 
pression in these cells was strong in plants harboring the 
construction pPAL16 that contains only the 16-bp palindrome 
portion of the upstream activating sequence. An intermediate 



Figure 4. (continued). 

For each row, the left-most picture indicates a section through the apical meristem. Pictures progressively to the right in each row show sections 
taken from successively tower portions of the stem. Tissues were fixed with glutaraldehyde, stained with XGal, embedded and \sZ^Z 
described in Methods. Bars = 100 \im. seciionec as 

(A) Section through the apical meristem region of a pEN1 plant. Ip, leaf primordium; mm, marginal meristem 

(B) Cross-section through the stem of a pEN1 plant, e. epidermis; iph, inner phloem; oph, outer phloem; p, parenchyma- x xvlem 

(C) Cross-section through a lower portion of a stem from a pEN1 plant showing a leaf gap region. Ig, leaf gap ' ' 

(D) Higher magnification of the leaf gap region shown in (C). Ig, leaf gap. 

(E) Section through the apical meristem of a pAlu106 plant. Ip. leaf primordium; mm, marginal meristem. 

(F) Cross-section through the stem of a pAlu106 plant, e. epidermis, iph, inner phloem; oph, outer phloem; p, parenchyma- x xvlem 

(G) Cross-section through a lower portion of a stem from a pAJu106 plant showing a leaf gap region. Ig, leaf gap ' ' 

(H) Cross-section through a still lower portion of a stem from a pAlu106 plant showing an emerging root, rp, root primordium 

(I) Section through the apical meristem of a pAIRa45 plant. Ip, leaf primordium. 

(J) Cross-section through the stem of a pAJRa45 plant, e. epkiermis; iph, inner phloem; oph, outer phloem; p, parenchyma- x xvlem 
(K) Cross-section through a lower portion of a stem from a pA!Ra45 plant showing a leaf gap region. Ig, leaf gap. ' ' 

(L) Higher magnification of a cross-section through the stem of a pAIRa45 plant, iph, inner phloem; oph, outer phloem- x xylem 
(M) Section through the apical meristem of a pPAL16 plant, mm, marginal meristem; vb, vascular bundle. 

(N) Cross-section through the stem of a pPAL16 plant, e. epidermis; iph, inner phloem; oph, outer phloem; p, parenchyma- x xvlem 
(O) Cross-section through a lower portion of a stem from a pPAL16 plant showing a leaf gap region. Ig, leaf gap. ' ' 

(P) Cross-section through a still lower portion of a pPAL16 stem showing an emerging root. 
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Figure 5. Histochemical Analysis of p-Galactosidase Activity in the Roots of Transgenic Tobacco Plants Harboring Various ocs Upstream 
Activating Sequence Constructions. 



Sections were fixed with glutaraldehyde, stained with X-Gal, embedded, and sectioned as described in Methods. Bars 

(A) Longitudinal section through the root tip of a pEN1 plant, e, epidermis; rc, root cap. 

(B) Cross-section through the root of a pEN1 plant, c, cambium; e, epidermis; vc, vascular cylinder. 

(C) Longitudinal section through the root tip of a pAlu106 plant, e, epidermis; rc, root cap. 

(D) Section through the root of a pAlu106 plant showing an emerging root hair, rp, root primordium. 
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level of expression was seen in the vascular tissues in plants 
harboring the constructions pAlu106 and pAIRa45 that con- 
tain the 16-bp palindrome plus 90 or 29 bp, respectively, of 
DNA surrounding the palindrome. In addition, (J-galac- 
tosidase activity was not detectable in the cells at the head 
of nonglandular trichomas in plants containing the construc- 
tion pEN1, but expression of the ocs::lacZ fusion gene was 
readily detected in these cells from the p PALI 6 plants. It thus 
appears that DNA sequences surrounding the 16-bp palin- 
drome of the ocs upstream activating sequence limit the ex- 
pression of the linked ocs promoter to particular cell types. 
Silencer sequences that restrict the expression of the cab 
gene in nonroot cells have been identified (Simpson et al., 
1986). It should be emphasized, however, that the 16-bp 
palindrome does not render the ocs promoter completely 
constitutive; expression is still highly specific to particular 
cell types. 

Our data appear to contradict those of Teeri et al. (1989). 
These authors examined the expression of an ocsv.lacZ fu- 
sion gene in the roots of transgenic tobacco plants. Their 
analysis showed strong expression of P-galactosidase activ- 
ity in the epidermal cells of these roots but no expression in 
the procambial or vascular cells. Our analysis indicated the 
opposite pattern of expression: we detected p-galactosidase 
activity only in the procambial and vascular cells but not in 
the epidermis or in the root cap. The lacZ fusion of Teeri et 
al. (1989) used a position within the ocs gene identical to 
ours. Although it is not possible from the article of Teeri et al. 
(1989) to determine which portions of the ocs upstream ac- 
tivating sequence they included in their construction, our 
analyses, using the entire upstream activating sequence as 
well as various portions of this sequence, identified a consis- 
tent pattern of expression in the roots that was not dependent 
upon the region of the upstream activating sequence used in 
the construction. It is, therefore, unlikely that the differences 
between our results and those of Teeri et al. (1989) can be 
attributed to the use of different fragments of the ocs up- 
stream activating sequence. The reason for the differences 
in the results between the two groups cannot presently be 
explained. 

The pattern of expression directed by the ocs upstream ac- 
tivating sequence in the roots of transgenic tobacco plants 
identified in this report resembles the pattern identified by 
Fromm et al. (1989). These authors were unable, however, to 
detect expression directed by this upstream activating se- 
quence in the stem or leaves of these plants. We were easily 
able to detect expression in these tissues. Fromm et al. (1989) 



used a construction consisting of portions of the ocs up- 
stream activating sequence (including the 16-bp palindrome) 
fused to a truncated CaMV 35S promoter. This promoter/acti- 
vator combination was affixed to a GUS gene as a transcrip- 
tionaJ fusion. Our constructions contained various portions of 
the ocs upstream activating sequence, the ocs promoter, and 
the first third of the ocs structural gene. Because we used ho- 
mologous portions of the ocs regulatory and transcribed se- 
quences, our constructions most likely reflect the true pattern 
of ocs gene expression seen in transgenic plants. Indeed, at 
the level of resolution afforded by RNA gel blot analysis, our 
histochemical data and our analysis of the steady-state levels 
of RNA in the tissues of transgenic tobacco plants were con- 
sistent. It is possible that the combination of the ocs upstream 
activating sequence and the truncated CaMV promoter used 
by Fromm et aJ. (1989) resulted in a novel pattern of expres- 
sion that differed from that seen using the homologous ocs 
upstream activating sequence and promoter sequences. 

The 16-bp palindrome of the octopine synthase transcrip- 
tional activating sequence binds the nuclear protein OCSBF-1 
(Singh et al., 1989, 1990; Tokuhisa et al.. 1990). Analysis of 
the expression of the OCSBF-1 gene in maize indicated that 
the gene was most highly expressed in actively dividing and 
meristematic cells (Singh et al., 1990). Such an analysis has 
not yet been reported for tobacco. It is possible that the cell- 
specific expression of the ocs gene in tobacco reflects the 
prevalence of OCSBF-1 in certain cell types. The cell-specific 
pattern of ocs gene expression could then be explained by 
DNA sequences that regulate OCSBF-1 gene expression 
rather than DNA sequences that regulate ocs gene expres- 
sion. We feel that although it is possible that this may repre- 
sent a major level of ocs gene regulation in tobacco it cannot 
be the sole mechanism of ocs gene regulation. Sequences 
surrounding the 16-bp palindrome clearly modulate the ex- 
pression of the ocs promoter both in particular cell types of 
tobacco plants (this study) and the quantitative level of ex- 
pression of the ocs promoter in tobacco calli (Leisner and 
Gelvin, 1989). The synergistic interactions of various subdo- 
mains of the CaMV 35S promoter to yield novel cell-specific 
patterns of expression have been documented (Benfey et al., 
1990a, 1990b). It is possible that, perhaps to a lesser extent! 
the ocs upstream activating sequence is also composed of 
subdomains that when deleted or combined with tran- 
scriptional activating elements from other genes can confer 
novel patterns of expression upon linked promoters. Experi- 
ments to test this hypothesis are currently underway in this 
laboratory. 



Figure 5. (continued). 

(E) Longitudinal section through the root tip of a pAIRa45 plant, e, epidermis; re, root cap. 

(F) Longitudinal section through a region of a root of a pAIRa45 plant >1 cm from the root tip. vc, vascular cylinder. 

(G) Longitudinal section through the root tip of a pPAL16 plant, e, epidermis; re, root cap. 

(H) Cross-section through the root tip of a pPAL16 plant, c, cambium; e, epidermis; vc, vascular cylinder. 
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METHODS 



Bacterial Strains and Media 

Escherichia coll DH5a was grown in LB medium (Maniatis et al., 
1982) at 37°C. Agrobacterium tumefaciens LBA4404 and derivatives 
were grown in either AB minimal medium containing 05% sucrose 
or YEP medium (Lichtenstein and Draper, 1986) at 30°C. Antibiotics 
were used at the following concentrations (uo/mL): for E. coli- 
ampicillin, 100; kanamycin, 50; for A. tomefaoens-carbenicillin, 
100; kanamycin, 100; rifampin, 10. Plasmids were mobilized from 
E. coll to A, tumefaciens by a triparental mating method (Ditta et al., 
1980) using the mobilizing functions of pRK2013 (Figurski and 
Helinski, 1979). 



Plasmid Constructions 

Plasmids containing ocsvJacZ translation^ fusions were derived 
from the previously described plasmids pEN1, pAlu106, pAIRa45, and 
pPAL16 (Leisner and Gervin, 1989), which contained the octopine 
synthase gene and promoter and various portions of the ocs tran- 
scriptional activating element. pRS414 (Simons et al., 1987), contain- 
ing a promoterless lacZiA sequence, was digested with SnaBI (which 
cuts in the lacY gene) and a Hindi fragment was inserted that con- 
tains bidirectional polyadenylation signal sequences from the T-DNA 
0/ and V genes (coordinates, 21,727 to 22,440; Barker et aJ., 1983). 
The resulting plasmid (pRS414-polyA) was linearized at the unique 
EcoRI site preceding the tacZ gene. pEN1, pAJulOG, pAIRa45, and 
pPAL16 were digested with EcoRI (which deletes the carboxy portion 
of the octopine synthase protein and the ocs polyA signal sequence) 
and individually ligated to pRS414-polyA. The resulting plasmids 
(pENI-lac, pAlu106-lac pAIRa45-lac, and pPAL16-lac) contain vari- 
ous portions of the ocs transcriptional activating element linked to the 
ocs promoter and an octopine synthase- p-gaJactosidase transla- 
tion^ fusion. 



Nucleic Acid Manipulations 

Recombinant DNA procedures were performed according to 
Maniatis et al. (1982) under PI containment conditions as specified 
by the guidelines of the National Institutes of Health. 

RNA was extracted from plant tissue (15 plants were examined) by 
the following procedure: tissue (05-4.0 g) was frozen in liquid N 2 
and ground to a fine powder using a mortar and pestle. One milliliter 
of Tris-neutralized phenol was added, and the tissue was ground fur- 
ther while thawing. One milliliter of RNA extraction buffer (05 M Tris- 
HCI, pH 70, 100 mM Nad, 50 mM Na 2 *EOTA, and 1.0% SDS) was 
added, and the tissue was ground until it reached a homogeneous 
consistency. The mixture was transferred to a 15-mL polypropylene 
tube and centrifuged at 101000 rpm for 15 min. The supernatant solu- 
tion was removed to a fresh tube, and the pellet was reextracted with 
1 mL RNA extraction buffer. The aqueous phases were combined 
and reextracted with 2 mL phenol. The supernatant solution was col- 
lected, and nucleic acids were precipitated by the addition of 4 mL 
ethanol. Following incubation at -20°C, the nucleic acids were col- 
lected by centrifugation, dried, and dissolved in 05 mL water; then 
0.25 mL 6 M LiCI was added. The samples were incubated at 4°C 
overnight, and the RNA was collected by centrifugation. The RNA 
was dissolved in H 2 0 and precipitated by the addition of 0.1 volume 



3 M NaOAc and 25 volumes ethanol. The RNA was collected by 
centrifugation and resuspended in HjO, and the concentration was 
determined by UV light absorption at 260 nm. 

Twenty micrograms of total cellular RNA was fractionated by 
formaldehyde-agarose gel electrophoresis, blotted to nitrocellulose, 
and hybridized to a probe containing the ocs gene (a 951-bp BstNl 
fragment; coordinates, 12,602 to 13553; Barker et al., 1983), as previ- 
ously described (Karcher et al., 1984). 

Plant Transformation and Hlstochemlcal Detection of 
0-Galacto$idase Activity 

Nicotiana tabacum cv Wisconsin 38 leaf discs were infected with A. 
tumefaciens LBA4404 containing various ocsJacZ fusion construc- 
tions. Kanamycin-resistant transgenic plants were regenerated ac- 
cording to Horsch et al. (1985). Sterile plants were maintained in Ma- 
genta cubes at 25°C and propagated vegetatively by transferring 
shoot cuttings. Rooted plants were assayed for p-galactosidase activ- 
ity 4 weeks following transfer. 

Histochemical staining was performed as described by Teeri et al. 
(1989), with the following modifications. Leaf, stem, and root tissue 
of transgenic and control plants were fixed with glutaraldehyde in Z 
buffer (pH 7.4) at room temperature. For each construction, the ex- 
pression of the ocsvJacZ fusion gene was analyzed in a young leaf 
near the top of the plant and in an older leaf near the base, in the 
upper and lower parts of the stem, and in the roots. Leaf and stem 
sections were fixed in 1.0% glutaraldehyde and roots in 05% 
glutaraldehyde for 1 hr. For better penetration of the fixative, the tis- 
sue was flash frozen in liquid nitrogen and fixed again in glutaralde- 
hyde for 1 hr. The glutaraldehyde was removed by rinsing the tissue 
three times in Z buffer; then the tissues were incubated in staining 
solution (880 uL Z buffer, 50 *iL 100 mM KJFefCNJd, 50 jiL 10 mM 
KJFefCNJeJ. 8i0% X-GaJ) overnight at 28°C. The material was rinsed 
with Z buffer twice and treated with acetomethanol (1:3) for 1 hr at 
room temperature to remove chlorophyll. After rinsing in 95% ethanol 
and absolute ethanol, the tissue was embedded in Tissue Prep 2 
medium (Fisher; melting point 565°C) and cut into 20-um sections. 

Control plant tissue (a kanamycin-resistant transgenic tobacco 
plant containing the construction pOCSA2) was always prepared par- 
allel to the experimental samples. Experimental samples were ana* 
lyzed only if the control plant showed no endogenous p-galactosi- 
dase activity. 
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nesquehonite may form di recti y from the 
resultant solutions. Therefore, the observed 
mineralogy and morphology of the nesque- 
horutc and current knowledge of ncsquc- 
honitc stability arc both consistent with 
direct formation of nesquehonite under 
Antarctic conditions. The formation of ad- 
ditional nesquehonite during curarion of the 
meteorite can be understood as continued 
outward migration and evaporation of the 
saline solutions formed in Antarctica. 

Antarctic meteorites have been on the 
earth for on the order of 10 4 to 10* yean 
and have apparently been transported great 
distances by glaciers (25, 26). Whether 
chemical weathering of meteorites occurred 
during their encasement in ice or only after 
they were exposed to the atmosphere on 
stranding surfaces has been uncertain (5). 
Results for nesquehonite from LEW 85320 
suggest that, at least for salt formation, 
weathering may be sufficiendy rapid that 
most observable effects can develop in tens 
of years rather than over thousands of years. 
Rapid formation of nesquehonite suggests, 
but docs not prove, that most weathering 
phenomena occur after exhumation of the 
meteorites from deep- glacial ice. In the sim- 
plest interpretation, ages of magnesium car- 
bonate weathering products on Antaraic 
meteorites would indicate the times elapsed 
since surface exposure of the meteorites. The 
amount of rust on a surface alone cannot be 
used as a reliable indicator of the state of 
preservation of meteorites. The correlation 
between degrees of weathering and terrestri- 
al-residence ages might evade detection if 
reliance is placed solely on the rust index. 
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Herbicide Resistance in Transgenic Plants 
Expressing a Bacterial Detoxification Gene 

David M. Stalker,* Kevin E. McBride, Lorraine D. Malyi 



The herbicide bromoxynil (3,5-d^bromo^-hydroxybenzonitrilc) is a photosynthetk 
(photosystem II) inhibitor in plants. A gene, bxn> encoding a specific nitrilase that 
converts bromoxynil to its primary metabolite 3,5-dibromo-4-hydroxybenzok acid, 
was cloned from the natural soil bacterium Klebsiella ozatnae. For expression in plants, 
the bxn gene was placed under control of a light-regulated tissue-specific promoter, the 
ribulosc bisphosphate carboxylase small subunit Transfer of this chimeric gene and 
expression of a bromoxynil- specific nitrilase in leaves of transgenic tobacco plants 
conferred resistance to high levels of a commercial formulation of bromoxyniL The 
results presented indicate a successful approach to obtain herbicide resistance bj 
introducing a novel catabolic detoxification gene in plants. 



ADVANCES IN GENETIC ENGINEER- 
ing of plants have allowed the devel- 
opment and transfer of agronomi- 
cal!}' important traits such as viral resistance, 
insect resistance, and herbicide resistance. 
These traits allow transfer of single domi- 
nant genes that exhibit a rapidly discernible 
phenotypc. Herbicide resistance can be 
achieved by at least three different mecha- 
nisms: overproduction of a herbicide -sensi- 
tive biochemical target; structural alteration 
of a biochemical target, resulting in reduced 
herbicide affinity; or detoxification-degrada- 
tion of the herbicide before it reaches the 
biochemical target inside the plant cell. 
Resistance obtained by the first two mecha- 
nisms has been developed for the herbicides 
gryphosatc (1; 2), atmine* 4 ^ the? SuHbrr?-' 
lurcas (4), and phosphinothricin (5). A re- 
port describing the transfer to plants of a 
Strep tomyces gene encoding a phosphino- 
thricin aceryltransfcrase (6) resulted in phos- 
phinothricin- resistant plants, establishing 
detoxification by conjugation as a viable 
strategy. Two advantages of a detoxifica- 
tion-degradation mechanism, as opposed to 
altering a biochemical target, are that spe- 
cialized compartmcntation of the detoxify- 
ing activity is not required and that greater 
herbicide resistance can be achieved with 
lower levels of detoxifying enzyme. Disad- 



vantages include the potential toxicity of 
one or more metabolites and the possibility 
that detoxifying activities might react with 
endogenous plant compounds to impair 
plant function. 

We have been interested in the nitrilc- 
containing broadleaf herbicide brofnoxynil 
(S^^bronx^hyctoxybcnzonitrile), a po- 
tent photosystem II (PSH) inhibitor. Al- 
though the actual chk>roplast-localized bio- 
chemical target is not well defined, there is 
evidence that bromoxynil acts by binding a 
component of the quinone-binding protein 
complex of PSII, inhibiting electron transfer 
(7, 8). It has further been suggested that a 
low- affinity binding site within this complex 
exists in the (ft W 

environment, as microbial populations and 
tolerant plant species can convert the cyano 
moiety of bromoxynil to the corresponding 
amide and acid derivatives (tt-13). A nam- 
• raj soil isolate, Klebsiella ozaenae, has been 
identified that transforms bromoxynil to 
3,5-djbromo-4-hydrox\ r ben2oic acid, releas- 
ing ammonia (14). This reaction (Fig. 1) is 
carried out via a bromox\Tul-specific nitri- 
lase. The nitrilase gene (bxn), which is ptas- 
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mid -encoded in K. t'c.joue, was cloned on a 
2,f>-Lb ON A fragment .ind expressed in 
l^iinnJtui ' 15). The nucleotide sequence 
of a 1.2-kb DNA segment was subsequently 
determined and shown to encode a 38-kD 
polypeptide ( I'>). The bromnvvnil- specific 
nitnlasc * pec i tied by rhis fragment was puri- 
fied to huiin penury and characterized i /'n. 
A chimeric ii\n gene lus now been con- 
structed tor expression in plants. The plant 
promoter used rbr this chimeric gene is 
derived from a tobacco light -inducible tis- 
sue-specific ribulose: bisphosphace (RuBP) 
small subunit gene ( Genes from the 
RuBP small subunit family have been shown 
to be expressed only in photosvnthetic tissue 
* IS. /Vi. The rationale for using rhis pro- 
moter us the follow ing. If the primary target 
of bromoxynil is photosynthesis and if the 
herbicide is not translocated within the 
plant, then detoxification of* bromoxynil in 
green tissue should be a minimal require- 
ment to obtain whole plant resistance. 
Transgenic tobacco plants expressing the 
bromoxynil-specific nirrilase in photosvn- 
thetic tissues are resistant to high doses of a 
commercial formulation of bromoxvnil, and 
die trait is it ably inherited in the succeeding 
generation. 

Xiiotutu tabacum cv. \\anthf plants were 
maintained axchkrally through shoot trans- 
plants and used as tissue dooors. Disks (2. 
mm in diameter) were excised from young 
leaves and placed in Murashige and Skoog 
medium containing indoieacetic acid { 2 mg/ 
liter) and kinetin (2 mg/liter) and placed in 
the dark at 23°C. Agrobaclerium tumejaciens 
strain LBA4404 (20) containing the shuttle 
vectors pBrx39 or pBrx40 (21) carrying 
chimeric genes for expression in plants were 
grown overnight and added to the leaf disks. 
Cocul fixation, selection of transformed 
shoots on kanamycin (100 mg/liter), and 
regeneration of plants were essentially car- 
ried out as described (22). Plants were trans- 
ferred to 10-cm pots containing soil and 
maintained in a growth chamber (25'C 
50% relative humidity, and 16-hour photo- 
period) or under greenhouse conditions. 

A number of independent kanamycin- 
resistant transgenic plants generated by co- 
nation were selected for analysis. Leaf 



timings were obtained horn individual 
plants and, in two separate experiments, 
were maintained phot oaumtroph teal !\ in 
rhc presence of increasing bromoxynil con- 
centrations; chlorosis was monitored (Fig. 
2). Gmtrol tissue is normally bleached at 
10 h M to 10'-.V/ bromoxynil. Leaf tissue 
from a number of transformed plants (39-3, 
40-2. 40 4. 40-6, and 40- "i appeared nor- 
mal at 10 \W bromoxynil. and tissue from 
plants 40- 1 and 40-5 appeared normal at 
10 4 \t bromoxynil. Plants 39-2 and 40-3 
were bleached at the same rate as control 
dimples and subsequently were shnwn not 
to Ix* transformed. 

To further characterize these plants with 
respect to "integration of the chimeric bxri 
gene and expression of the bromoxvnil- 
spcuric nitrila.se. we analyzed individual 
plants indued to be broniownil- resistant 
from the leaf section experiment. DNA blot 
analysis of six of the transformed plants 
indicated that these plants conrained copies 
of h.xn gene sequences ( Fig. 3). Five of the 
plants contained the intact 4.5-kb Hind III 
fragment, the expected size of the small 
suhunit-/>.v/i-tx*topinc synthase (<»o) chi- 
meric gene. Plants 40-1 and 40-4 had an 
extra DNA segment " containing bxn se- 
quencing, and plant 40-5 had DNA frag- 
ments of two different sizes, indicating rear- 
rangements of the chimeric gene during 
.4yro/wrtTiMM- mediated transfer and integra- 
tion into plant DNA. Copy number analysis 
indicates that one to three copies of the hxn 
gene arc present, depending on the trans- 
genic plant analyzed. Leaf samples from 
these selected plants were subjected to im- 
munoblot analysis (Fig. 3). Plants 40-1, 40- 
5, and 40-7 had high levels of the 38-kD 



nitnlasc polypeptide, whereas piants 40-4 
and 40 6 had much low er locls of nitnlasc. 
repression of the enzyme in plant 39-3 is 
\ cry low and can only be detected by overex- 
posing the blot. Antiserum to the bromov 
ynil-speciric nitrilase is not cross-reactive 
with any major plant components (although 
a nonspecific interaction of the immuno- 
globulin G fraction can be scent. The ap- 
pearance of only the 3tS-kD polypeptide and 
ix> degradation products indicates stability 
of this polypeptide in plant cells. These 
differential protein levels observed for the 
independent transtornunts correlate with 
the level of bromoxvnil resistance obtained 
in the leaf sea ion assay described in the 
legend to Fig. i. Leaf tissue Trbm plants 40- 
1 and 40-5 contained the highest levels of 
nitnlasc. and no chlorosis was observed at 
10 J .W bromoxvnil. These data suggest that 
small subunit promoter-controlled expres- 
sion of the bromoxynil-specific nitrilase in 
leaf tissue results in protection of leal' tissue 
from photosvnthetic damage. Messenger 
RNA levels were determined for one of 
these transgenic plants (40-1). Chimeric 
mRNA of the expected size (1.8 kb) was 
observed at a high level in leaf tissue, a low 
level in stem tissue, and was undetectable in 
roots UM). This finding indicates that the 
small subunit promoter was runcrioning in a 
tissue-specific manner. 

Four of the primary (T|) transformanrs 
were subjected to both open rxjflination and 
backcross experiments for genetic analysts 
tests. These transformants were chosen be- 
cause thev exhibited a range of nitrilase 
levels and contained a non rearranged 4.5-kb 
small subunit-/).YH-oc> chimeric gene. Plants 
(T:) were grown from seed and subjected to 



Table 1. Inheritance of the bxn gene in transformed plants* Approximately 200 T : progeny plants were 
obtained from both open pollination (OP) and backcross expcnmcQts of four primary (T t ) transfor- 
mants and analyzed by spraying at the three to four leaf stage with a commercial formulation of 
bromoxynil { Buctril) at 0.5 lb per acre. Segregation ratios and \' values were calculated on the basis of 
the expected values. Qualitative determination of nitrilase levels was derived from scanning protein 
immunoblots of T, transformed leaf tissue ( Fig. 3). Values axe expressed as a percentage of total leaf 
protein. Approximately 300 T 2 progeny plants were obtained by open pollination of the original plant. 
Plants were divided into lots of 30 each and sprayed with the following concentradons of Buctnl: 0, 
0.5, 1.0, 1.5, 2.0, 2.5, 3.0, 3.5, and 4.0 lb per acre. Maximum resistance level was detcnained as the * 
concentration at which chbrotic lesions appeared. Scgnegantt that did not carry the gene were not 1 
scored. 
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• »:m. in :i<mi *e;*vted rrai>;. piar.iv ^rtavc Nicnu/cd. placed in the 

c<mk durations ot hrumoxynii indicated, and maintained photoautotrophi- 



. .iiii r- siv u,is m«<r.iti»ri\l. 1 M ;« »t* cr t pi^ -Acre raxen II da*. > arte 
inn.atien ui tlx- experiment. .V/. 'XanuV refers to control leaf sections. 



genctic analysis by spraying with a commer- 
cial formulation of bromoxvnil fBucrriK 
Rhone- Pouicnc) at 0.5 ib kt:; per acre 
(normally a high Held rate:. Another >ct ot 
plants was subjected to Buctril spraying at 
0 5 !b per acre increment* to dcrcrv.'nc 
maximum ie\el of toleraiKe before tiu^riitic 
legion* appeared i Table 1*. The bmrnowni! 
resistance trait segregated a* a single genetic 
locus in progeny of piano 39-3, 40-6, and 
40-7, and as two independent loci in progc- 
nv of 40-1 in both open pollination and 
backer* iss experiments. The maximum Buc- 
tr:l resistance levels exhibited bv the pn -gem 
of these tour plants also correlated with the 
protein :c\ei observed in leaf tissue » *b:^:::ed 
from the respectiw* primary T| :ra;>for- 
mant. Progeny from 39-3. which contained 
the lowest level of nitrilasc, exhibited symp- 1 
toms at lower rates (1.5 to 2.0 Ib per acre), 
whereas progeny of 40-1, which contained 
high nitrilase levels, exhibited no symptoms 
even at 4.0 lb per acre. Thus result suggests 
that enzyme levels determined in the pri- 
mary' trans forrrunts arc maintained in their 
progeny and that the greater the level of 
nitrilase in the leaves of transgenic plants, 
the higher the level of bromowni! resist- 
ance. Selected plants from the pmgeny T : ; 
i if plain 40-1 that were graved with br<>- 
moxynil at increasing rates are shown in Fig. 
4. These transgenic plants expressing nitri- 
lase grow, flower, and set seed normally, 
even when sprayed wiih concentrations of 
bromoxyni! eightfold higher than the high- 
est field rate normally used. The enzyme 
appears to be localized in the plant cell 
cytosol, as chJoroplasts prepared from leal* 
tissue contain no detectable nitrilase poly- 
peptide nor can the bromox\*niUpecific ni- 
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Fig. 3. (A) PUnt DNA A 
was prepared as de* ^ 

bribed .:*) from l-g 
leaf samples and digested 
to completion with 
Hind 111, and 5 to 7 • 
of rural planr DNA were 
cu.\:n ipimrcsed in a T'n 
agarose gel. The DNA 
vvj^ rraiMerrcd to nitro- 
cellulose, and the bloc 

was hybridized with a . - . 

1.2 kb Bam HI-Eco RJ 
''ah gene fragment la- 

:vx\fuith ,: P\jVi. \r 
reterv to DNA isolated 
tmm m intransforme J 
tobaeeo tissue Control 

oMMjin* rhe isolated 4. .v Lb H:;\i MI fraiinvr.t n\o copies per genome equivalent) added to DNA 
elated from nonrramfornx*: :*u:\t. B Imimmt chemical detection ot rhe ciu\mc from leaf tissue ot 
^eie^ted transforrnams. filottinu procedure i\ de^nbed in <JfK^:. Control indicates non transformed 
leaf tissue. The second lane contains 200 ng of purified nitnlase t /e>> spiked into a nontranstbrmed leaf 
sampkapd>nrnui>^^ . _ ^, ^ .«.^> 



Fig. 4. Phenotypc of transformed plants. T ; seed from primary transform ant 40- 1 was grown in 10-em 
pocs (three plants per pot) to the three to four leaf stage and sprayed with a commercial formulation of 
bromoxynil (Buctril) at die amounts indicated. Spray blank contained the formulation without the 
active ingredient. The photograph was taken 6 days after spraying. 
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Sxutu nitiiM^- in ph..(..MinlK-iu ii-mk- ..I 
;r.iiiNUcnk puini% mmiKts hich*Ic\cl roiM- 
.uucro brnmmvml. The herbicide in am- 
wrtcil to a n.'iH.'\:v mci.iN ilitc. .v.VJi- 
;v nni»-4-ln Jn»\\lvn/MK ku). h\ :hc cri- 
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(jppri).\inuu]y U.O()0~"u toui k at pnucm- 
is required to obtain high commercial resist- 
ance levels: the higher the expression level »»t 
the -ire.uer ;!v decree • >* roist.iiue 
tii.it can K .icmcvcii airhnuuh ihi* :*» -j 
direct linear relation). Expression of the 
enzyme in plants can be achieved under the 
control of a tissue- specific light -regulated 
promote! i<- main ".\ hole punt resjsi.UKc 
lV»tccfi<«n ot rhe ' 
>\ ^;^;u il lar^e: 
obtained b\ expressing die bromo\\nil-spc 
citic nitrilase in a separate cellular compart- 
ment (cvtosoh. Tlie trait is dominant and 
heritable, and the enzyme levels obtained in 
rhe primary transformants relate to bromox- 
vnil resistanee lexeN observed in the suc- 
ceeding generation. These ob\crvar ions sug- 
gest that in the case of the bromonTiil- 
MH-cilic n!t:::a^e -he phosplun< 'thricm 
jcerA'ltransteraM: e>i. dero\iricatioivdegrada- 
t it hi is superior to altering a bn •chemical 
target to obtain herbicide resistance in 
plants by gene transfer technology. Complex 
pathways and mutant proteins and genes do 
nut have to be cngi::*e:vd ;v-r are special- 
ized targeting functions tor specific cellular 
etimpanments needed. The sole require- 
ment consists of a single step conversion of 
the herbicide to a iviucw tr.euiNv.ic. 

Although exteiisixe analysis <>t residues in 
these transgenic plants has not been com- 
pleted, it will be interesting to determine the 
fate of the metabolite (3,5-dibromo-4-hy- 
droxy benzoic acid) produced by the action 
of nitrilase. If the bromine atoms of 3,5- 
dibromo-4-hydrovy benzoic acid are hydro- 
lyzed by an endogenous plant dehaJogenasc, 
then the product of this reaction would be p- 
hydroxybenzoic acid, a norma] constituent 
of plant metabolism. In this case, no metab- 
olites of bromoxvnil would be detected. 
Alternate civ. 3.5-dibromo-4-hvdrovvben- 
zoic acid could be- concentrated in the vacu- 
ole or transported out of the cell or both 
may occur. The brorrKocy nil-specific nitrilase 
is active in vitro only as a dimcr (16). 
suggesting that this polypeptide must un- 
dergo formation in the plant cell into an 
active complex. Supporting this phenome- 
non is the appearance of a faint undenarured 
band at approximately 72-kD on immuno- 
blots (25). An analogous result was observed 
on simultaneous expression of the bacterial 
'•o A and luxB genes in transgenic plant* 



.'». -f de:%*i'» ■ •* in uVi \k:Ier.iM' 
1 in -v *v^;i:in iii! » * 'i n '\u ti n.u pr« »ten^ 
.in !ir:n uin, i n. ■ *ne*. in a dnersc plain 
veil ein ir« »miik nt 

rhe ^ene eiK«»viing the broiitt »\\ -ml- spe- 
cific nitrilase can be used as a selective 
marker tor rapid greening at the whole 
plant Ie\el I. eat seU;«-u^ ^.ui be excised from 
rhe plant and pin >toaut« itrophKal- 

Iv. or tndivkiuai leaves van K* treated unh 
commercial bromoxxnil-tormulations with- 
out harm to the remainder ot the plant. At 
hiuh hri:mownil ^< »nccni rat ions, svmpmms 
,.in Ix ^»»tcd within 24 to m> hmirs atui 
application. Finally, expression of the K. 
ozaenae bromoxynii- specific nitrilase in 
transgenic plants to obtain a bromoxynil- 
resistan; -r\ t - " uioiher example that 

at:n»iv»iniva;l\ niuvrr.-ni uernnMa<m can Iv 
der:\^: :;»*=:i t "'■ '"..-.am m-ii:^ aiui thai 
novel and diverse biochemical pathways can 
be introduced into plants without deleteri- 
ous effects. 
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A cDNA clone for a cysteine proteinase inhibitor of 
rice (oryzacystatin) was isolated from a XgtlO cDNA 
library of rice immature seeds by screening with syn- 
thesized oligonucleotide probes based on partial amino 
acid sequences of oryzacystatin. A nearly full-length 
cDNA clone was obtained which encoded 102-amino 
acid residues. The amino acid sequence of oryzacysta- 
tin deduced from the cDNA sequence was significantly 
homologous to those of mammalian cystatins, espe- 
cially family 2 cystatins. Oryzacystatin contained the 
sequence Gln-Val-Val-Ala-Gly conserved among most 
members of the cystatin superfamily. The gene for 
oryzacystatin was transcribed into a single mRNA spe- 
cies of about 700 nucleotides. The content of mRNA 
reached its highest level 2 weeks after flowering and 
then gradually decreased to undetectable levels at 10 
weeks. This feature of transient expression is coordi- 
nate with that of glutelin (a major storage protein), 
although the expression of oryzacystatin precedes that 
of glutelin by about 1 week. 



Proteinaceous cysteine proteinase inhibitors of animal ori- 
gin have been extensively investigated to clarify their chemi- 
cal structures and physiological functions. These inhibitors 
are considered to have evolved from a cognate ancestral gene 
and form the cystatin superfamily which is further classified 
into three different families on the basis of their molecular 
structures (1). Family 1 cystatins lack disulfide bonds; human 
cystatins A (2) and B (3) and rat cystatins a (4) and 0 (5) are 
typical examples. Family 2 cystatins contain two disulfide 
bonds as exemplified by human cystatin C (6), human cystatin 
S (7), bovine colostrum cystatin (8), and chicken cystatin (9). 
Cystatins of both families are also characterized by their 
molecular weights that range from 10,000 to 20,000. Family 3 
cystatins comprise kininogen segments (10). The structures 
of various cystatins of these three families have been eluci- 
dated by amino acid sequencing or by cDNA cloning. All the 
elucidated cystatins conserve the sequence Gln-Val-Val-Ala- 
Gly or its homologues (11) which might be involved in the 
inhibition of cysteine proteinases (EC 3.4.22). 

However, no detailed information was available about plant 
cystatins. We purified a proteinaceous cysteine proteinase 

* The costs of publication of this article were defrayed in part by 
the payment of page charges. This article must therefore be hereby 
marked "advertisement" in accordance with 18 U.S.C. Section 1734 
solely to indicate this fact. 

The nucleotide sequence(s) reported in this paper has been submitted 
to the GenBank™/EMBL Data Bank with accession number(s) 
J03469. 



inhibitor from rice seeds (12) and characterized it as a cystatin 
(13). In the present paper we name this protein oryzacystatin. 

It would be interesting to compare the amino acid sequence 
of oryzacystatin to those of animal cystatins in order to find 
out if any homology exists. The comparison may also provide 
information useful to clarify the structure-function relation- 
ships of oryzacystatin. 

For this purpose we cloned and analyzed cDNA encoding 
oryzacystatin. We then analyzed the change in the amounts 
of corresponding mRNA during the ripening of rice seeds. 
The change in the amount of the mRNA encoding rice glutelin 
was also investigated in order to understand the phyto- 
physiological function of this inhibitor. 

EXPERIMENTAL PROCEDURES 1 

Materials — Escherichia coli DNA ligase and amino myeloblastosis 
virus reverse transcriptase were obtained from P-L Biochemicals and 
Bio-Rad, respectively. DNA polymerase I from E. coli, its Klenow 
fragment, terminal deoxynucleotidyltransferase, bacterial alkaline 
phosphatase, and T4 polynucleotide kinase were purchased from 
Takara Shuzo Co. The restriction enzymes used were products of 
Takara Shuzo Co., Toyobo Co., and P-L Biochemicals. The oligo(dT)- 
cellulose was a product of P-L Biochemicals. A nick translation kit 
and [-y-^lA TP ( 3000 Ci/mmol) were obtained from Amereham 
Corp. [a-^PJdCTP (3000 Ci/mmol) was obtained from Du Pont-New 
England Nuclear. 

Rice Seeds — The rice cultivar Nihonbare (Oryza sativa L. japonica), 
a middle ripening variety, was used. The plant was grown in the 
Experimental Farm at the University of Tokyo, and the seeds were 
harvested at 1-week intervals after flowering. The seeds were scepti- 
cally removed from the pods, frozen in liquid nitrogen, and stored at 
-80 # C until used. 

Determination of Partial Amino Acid Sequence and Designing of 
Oligonucleotide Probes — Oryzacystatin was purified from matured rice 
seeds as described (12). The purified oryzacystatin was digested with 
metalloendopeptidase (EC 3.4.24) (Grifolafrondoza, Seikagaku Kogyo 
Co.) (14) and the digests were separated on a Vydac ODS column (for 
details see Miniprint). Two sets of oligodeoxynucleotide mixed probes 
(A and B) were used. Probe A was based on the partial amino acid 
sequence of peptide fraction 6 and probe B on that of peptide fragment 
1 (Table IS). They were synthesized with a DNA synthesizer (Applied 
Biosystem Inc., model 380B) and labeled at their 5' ends with [7- 
**P]ATP and T4 polynucleotide kinase in a buffer containing 25 mM 
Tris-HCl (pH 9.0), 5 mM MgCl,, 5 mM dithiothreitol, and 25 Mg/ml 
bovine serum albumin. Probes A and B are a 16 mix of 17-mers and 



1 Portions of this paper (including part of "Experimental Proce- 
dures," part of "Results," Figs. 1S-4S, and Table IS) are presented 
in miniprint at the end of this paper. Miniprint is easily read with 
the aid of a standard magnifying glass. Full size photocopies are 
available from the Journal of Biological Chemistry, 9650 Rockville 
Pike, Bethesda, MD 20814. Request Document No. 87M-1837, cite 
the authors, and include a check or money order for $2.80 per set of 
photocopies. Full size photocopies are also included in the microfilm 
edition of the Journal that is available from Waverly Press. 
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a 1024 mix of 17-mers, respectively, their sequences being shown in 
Fie 2S 

Construction ofacDNA Ubrary-ToUl RNA was extracted by the 
Dhenol-SDS 2 method (15) from rice seeds harvested at 2 weeks after 
flowering. Poly(A*) RNA was purified by oligo(dT ) -cellulose cohamn 
chromatography (16). After the synthesis of single-stranded cDNA 
by reverse transcriptase with total poly(A*) RNA as a template, 
double-stranded cDNA was synthesized essentially according to the 
method of Gubler and Hoffman (17) using E. coli polymerase I. Flush 
ends of the cDNA were generated with T4 DNA polymerase, and the 
double-stranded cDNA was treated with EcoRI methylase. Phospho- 
rylated £coRI linker (12-mer) was ligated, digested with EcoRl, and 
inserted into the phage vector XgtlO. The DNA was then packaged 
into bacteriophage particles using the packaging extract uriu-r 
(Stratagene) and grown on E. coli strain C600HA- The yield was 5 X 
10* plaques/^g of mRNA. 

Isolation of cDNA Clones for Oryzacystatin-Recombinant plaques 
were transferred onto nylon filters (Hybond N, Amersham Corp^ 
The filters were fixed with ultraviolet light and prehybndized for 20 
h in 50 mM Tris-HCl (pH 8.0) containing 1 M NaCl, 10 mM EDTA 
0.2% Ficoll 400, 0.1% SDS, 200 >ig/m\ heat-denatured E. coli DNA 
and 200 jig/ml salmon sperm DNA at 49 *C for probe A or at 43 C 
for probe B. After the prehybridization, the filters were hybridized 
with 5' end-labeled probe A or B for 20 h at 49 or 43 *C, respectively, 
in the same solution used for the prehybridization. The ^ers were 
finally washed in 6 x SSC (1 x SSC is 0.15 M NaCl and 0.015 M 
sodium citrate), 0.1% SDS at 49 or 43 'C for probe A or B, respec- 
tively, and exposed to Fuji RX film with intensifying screens at 
-80 *C The positive plaques selected on the first screening were 
further screened with probes A and B under the same conditions used 
in the first screening. J ^ 

Nucleotide Sequencing-The DNA was extracted from the recom- 
binant phages, digested with EcoRI endonuclease, and subcloned into 
the plasmid vector pUC18. Recombinant plasmids were introduced 
into E coli MM294 as described by Hanahan (18). Colorues contain- 
ing the cDNA insert were selected by hybridization with P-labeled 
probe A. The cDNA inserts of both strands were sequenced in their 
entirety by the chain-tennination method of Sanger et oL (19) and 
by the chemical modification method of Maxam and Gilbert (20). 

RNA Blot Hybridization-Total RNA was extracted from nee seeds 
at various stages of maturation as described above. The RNA sample 
(2 tig in total RNA) was denatured and electrophoresed in a formal- 
dehyde-containing agarose gel (21). After electrophoresis the RNA 
was transferred to a nitrocellulose membrane (Schleicher & Schuell) 
and hybridized with nick-translated cDNA (21) at 42 "C m a solution 
containing 50 mM sodium phosphate (pH 7.0), 5 x SSC, 50% (v/v) 
formamide, 0.1% bovine serum albumin, 0.1% polyvinylpyrrohdone, 
0 1% Ficoll 400, 0.2% SDS, and 200 ng/m\ heat-denatured salmon 
sperm DNA. The filter was finally washed with 0.1 x SSC containing 
0.1% SDS at 42 °C and then exposed. . 

Genomic DNA Blot HybridUation-Rice genomic DNA was isolated 
from seeds essentially according to the method of Maniatis et a/. (21). 
About 6 fig of the DNA was digested with various restriction enzymes, 
fractionated on agarose gels, denatured, and transferred to ► nitrocel- 
lulose membrane according to the method of Southern (22). Filters 
were prehybridized at 65 *C under the same solution as used for the 
first screening of cDNA cloning and hybridized at 65 °C in a solution 
containing denatured nick-translated probes of ojryzacystatin or the 
glutelin (23). The filters were washed with 0.1 X SSC containing 0.1% 
SDS at 65 °C. 

RESULTS 

Isolation and Assignment of a cDNA Clone Encoding Ory- 
zacystatin- From 1.2 x 10 5 independent cDNA clones of a. 
rice seed XgtlO cDNA library, 12 clones were identified that 
hybridized with the ^-labeled probe A (Fig. 2S). Of these, 
four clones were also positive to probe B (Fig. 2S) and sub- 
jected to further analyses. Since the clone designated as 
XOC26 had the longest cDNA insert, its sequence was deter- 
mined by the strategy shown in Fig. 3S. Fig. 1 shows the 
complete nucleotide sequence of 598 base pairs which con- 
tained the sequences of both probes A and B. It also contained 
the poly (A) addition signal AATAAA in the 3' region, but 

2 The abbreviation used is: SDS, sodium dodecyl sulfate. 



-li 



CCCCCAOTCXCATCCCCCACGCCCAGAACCCCACCAGAACATCTCCACCCAC 

MetSerSerAsp 
1 

50 

GGAGGCCCGCTCCTTGCCGGCCTCG^^ KCT< = 
GlyClyProValLeuClyGlyValCluProValGlyAsnCluAsnAspLeuHisLeuVal 



GACCTCGCCCCCITCGCCGTCACCCACCAC 
AspUniAlaArqPJ^A^y^T^CJu^AsnLys 

1 50 

GAGAAGCTTGTCAGTGTGAAGCAGCAAGTTCTCGCTGGCACTTTC 
GluLysUeuValSerValLysGlnGlnValValAlaGlyThrbeuTyrTyrPheThrlle 



CAGGTGAAGGAAGCGGATGCCAAGAAGCTCTATGAAGCTAAGGTCTC 
ni.,VAli.ysGl U GlvAspAlaLysLysl^uTyrGl^Ma^y^lT£p^luLysProTrp 

300 

* tvir: * rTrr* * a Anr"TXX!AGG AGTTCAAGCCTGTCXS ATGCCAGTGCAAATGCCT AAGGC 
MetAspPheLysGluLeuGlnGluPheLysProValAspAlaSerAlaAsnAla*** 



250 



CCATCTCCATCCTATGTGTATCAAGTTATCTTCTTGAT^^ 
400 

TAGCTATTGGACATCTTAATTATCCACATGATAATAT^ 
450 

OGATAATATGGCTTX^ATATATAGCTATAAAGATTTACCTATGGCATATTCAATGTGTAT 



500 



550 



tactactaagtaagaatgattccaaggtctattaactacaaa™ 

TGTTAC 

Fig. 1. The nucleotide sequence of oryzacystatin cDNA in- 
sert and deduced amino acid sequence. The predicted amino acid 
sequence is shown below the nucleotide sequence. The nucleotides 
are numbered in the 5' to 3' direction, beginning with the first 
nucleotide of the initiation methionine codon ATG; the nucleotides 
on the 5' side of nucleotide 1 are indicated by negative numbers. The 
amino acid sequences of two peptide fragments for the synthesis of 
probes are upperlined and other isolated peptides fragments are 
indicated by arrows. The putative poly(A) addition signal sequence is 
underlined. 

lacked a poly(A) tail. Since one of the six possible reading 
frames of both strands contained all the six partial amino 
acid sequences of oryzacystatin (Table IS), we concluded that 
clone XOC26 encoded oryzacystatin in the reading frame 
shown in Fig. 1. The putative initiation codon (ATG) must 
be the real initiation codon for the following reasons. 1) The 
length of the mRNA (about 700 bases) estimated as in Fig. 3 
roughly corresponded to the total length of the cDNA insert 
of XOC26 and poly(A) tail. 2) Both the molecular weight 
(11,341) and the amino acid composition (Table I) calculated 
from the deduced amino acid sequence are consistent with 
experimental values (M r 12,000 from SDS-polyacrylamide gel 
electrophoresis and Table I) (13). 

Amino Acid Sequence Homology— Comparisons of the de- 
duced oryzacystatin sequence with all proteins in the NBRF 
database (released Nov. 1986) showed significant sequence 
homologies with proteins of three cystatin families (Fig. 2). 
For examples, in the region of residues 8-72 (oryzacystatin 
numbering), oryzacystatin had a homology of 30.3% with 
human cystatin A. Residues 8-66 of oryzacystatin was 30.6% 
homologous with human cystatin C. Residues 26-81 of ory- 
zacystatin also showed a 23.2% homology with human kinin- 
ogen segment (residues 130-251). Incidentally, it is notewor- 
thy that the conserved sequence Gln-Val-Val-Ala-Gly among 
several of the animal cystatins exists in oryzacystatin (Fig. 

2). . . 

Expression of Oryzacystatin mRNA during the Ripening 
Process of Rice Seed— The cDNA insert of XOC26 was used 
as a hybridization probe to evaluate the expression of oryza- 
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cystatin mRNA. As shown in Fig. 3, a single band of approx- 
imately 700 bases was detected for the mRNA species at all 
the maturation stages investigated. The content of oryzacys- 
tatin mRNA varied considerably during development. The 
mRNA for glutelin, a single band of about 2.0 kilobases, 
showed similar changes during ripening of the rice seeds. 
However, maximal expressions of the two mRNAs were ob- 
served at different stages; the oryzacystatin mRNA was ex- 
pressed maximally 2 weeks after flowering, whereas the 
amount of the glutelin mRNA reached a maximum at 3 weeks. 



Identification of the Genes for Oryzacystatin— To identify 
the genes for oryzacystatin and the glutelin, chromosomal 
DNA of rice seeds was digested with various restriction en- 
zymes and subjected to a total genomic Southern experiment 
using cDNA probes of oryzacystatin (XOC26) and the glutelin 
(AOG12). Fig. 4 shows profiles obtained in both cases. £coRI 
and Pstl digestion for oryzacystatin {lanes 2 and 5) and £coRI 
and Hindlll digestion for the glutelin (lanes 8 and 70) showed 
essentially single bands, although in both cases some minor 
bands were detected as well. 



Table I 

Amino acid composition of oryzacystatin 



Amino acid 


Nucleotide sequence* 


Experimental 


residue 


Residue 




value* 








mol% 


Asp 


6 


5.9 


10.6 


Asn 


5 


4.9 




Thr 


3 


2.9 


3.9 


Ser 


5 


4.9 


4.8 


Glu 


11 


10.8 


13.5 


Gin 


3 


2-9 




Pro 


4 


3.9 


4.8 


Gly 


7 


6.9 


7.7 


Ala 


9 


8.8 


9.6 


Cys 


0 


0.0 


0.1 


Val 


12 


11.7 


10.6 


Met 


2 


2.0 


1.0 


He 


1 


1.0 


1.8 


Leu 


10 


9.8 


9.6 


Tyr 


3 


2^ 


1.9 


Phe 


5 


4.9 


4.8 


Lys 


11 


10.8 


9.6 


His 


2 


2.0 


1.9 


Arg 


1 


1.0 


1.9 


Trp 


2 


2.0 


1.9 


Total 


102 


100.0 


100.0 



"The residue numbers and molar percent values are calculated 
from the sequence deduced (Fig. 1); the initial methionine is included. 
6 From Abe etal. (13). 



DISCUSSION 

In the present study we first isolated a cDNA clone encoding 
oryzacystatin. It is composed of 598 base pairs, lacking a 
poly(A) tail. A Northern hybridization study, on the other 
hand, showed that oryzacystatin mRNA was approximately 
700 bases in length (Fig. 3). Taking into consideration the 
length of the poIy(A) sequence existing in the mRNA, we 
estimate that the cDNA obtained could represent an almost 
full-length copy. 

Oryzacystatin shows significant sequence homology with 
animal cystatins (Fig. 2). In the sense that oryzacystatin bears 
no disulfide bond, it could be classified as family 1. However, 
in terms of the numbers and location of identical amino acid 
residues (Fig. 2), it seems that oryzacystatin more closely 
resembles family 2 than family 1 cystatins. For example, the 
sequences Phe-Ala-Val (residues 29-31) and Pro-Trp-Met 
(residues 83-85) of oryzacystatin are common to the family 2 
cystatins, whereas these sequences are not found in the family 
1 cystatins. In any event, it is likely that oryzacystatin and 
animal cystatins must have evolved from a cognate ancestral 
gene. 

The present study also provides useful information con- 
cerning phytophysiological significance of oryzacystatin. As 
the Northern hybridization experiment indicates (Fig. 3), the 
oryzacystatin mRNA in rice seeds showed its maximal expres- 
sion at 2 weeks after the flowering of the rice plant and then 
gradually decreased to an undetectable level at 10 weeks 



Fig. 2. Comparison of the amino 
acid sequences of oryzacystatin 
with other cystatins including ki- 
ninogen segments. Identical amino 
acid residues are boxed. The numbering 
starts from the initiation methionine 
residue of oryzacystatin. Gaps have been 
introduced to maximize the sequence ho- 
mology, a, human cystatin A (2); 6, hu- 
man cystatin B (3); c, rat cystatin (5); a\ 
human cystatin C (6); e, bovine colos- 
trum cystatin (8); f, chicken cystatin (9) 
and g, human kininogen segment (resi- 
dues 130-251) (10); h t human kininogen 
segment (residues 252-372) (10). 
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Fig 3 RNA blot analysis. Two /tg each of the total RNA from 
rice seeds harvested at 1 week (lane J), 2 weeks (lane 2), 3 weeks 
(lane 3), 4 weeks (lane 4), or 10 weeks (lane 5) after flowering were 
electrophoresed, blotted onto a nitwcdlulo»en^ 
ized with ^-labeled XOC26 insert (A) or "P-iabeled XOG10 insert 
(B) The positions of 18 S and 28 S ribosomal RNA markers are 
shown (XOC26 and XOG10 inserts are probes for oryzacystatin and 
glutelin, respectively). 
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FIG 4 Southern blot analysis of rice seed genomic DNA. 
Total rice DNA (-6 M g> was cleaved with ScoRI (lanes 2 and S), 
BamUl {lanes 3 and 9), HindUl (lanes 4 <md 10), Pstl (lanes 5 and 
11), and SacI (lanes 6 and 12), and subjected to blot hybridization 
using XOC26 (left) and XOG12 (right) as probes. 

at which time the ripening of the rice seeds is completed. On 
the other hand, the amount of mRNA encoding rice glutelin, 
a major storage protein, reaches a maximum at 3 weeks after 
flowering and then decreases gradually. Thus, the two mRNA 
species apparently show coordinate expression, although the 
oryzacystatin mRNA is synthesized and degraded more dies-, 
tically almost 1 week ahead of the glutelin mRNA. This 
phenomenon may reflect the existence of a system of regulat- 
ing the gene expression in rice seeds, involved in controlling 
the enzymatic proteolysis of the glutelin synthesized during 
the time from flowering to ripening. We have actually found 
in rice seeds a cysteine proteinase that efficiently hydrolyzes 



glutelin and is almost completely inhibited by an equimolar 
concentration of oryzacystatin in vitro (24). 

Knowledge about the structure and mode of expression of 
this rice cysteine proteinase, as well as those of oryzacystatin, 
will define a major system involving the proteolytic regulation 
of rice storage proteins. Gene structural analyses will elucidate 
the mechanisms for coordinate expression of these proteins. 
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Supplement to Molecular cloning of 
cy • t • t na proteinase inhibitor of rice 
(oryzacystatin) 
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Peptide i 
■AHA sequence 



Lye Pro Trp Nat Asp Phe 

1 1* AAA CCD UCC AUG CAU UU 1' 



EX PER I MENTAL PROCEDURES AND RESULTS 

Sa pa rat Ion «n£ analysis o£ peptides trow ftn oryzacystatin 
digest . Oryzacystatin was digested with metal loendopept Ida se in 
SO nM N-ethylmorpnoline buffer, pH 8.5, at an enzyme/ substrate 
ratio of t i 100 (wt/wt) for 16hr at J7«C. Tha paptldaa war* 
fraction* tad on a Vydac 008 column (Sanahu kagaku) by a linaar 
gradient of 0 to 60% acatonltrlU in 0.11 trif luoroacetlc acid at 
a flow rata of l al/min. Fig. 1 -S show* that result. Tha amino 
acid sequences of tha numbered peaks were determined aa shown in 
Table IS. 

Sequences of. tjhj synthetic ol lgodeoxynucleptide probe t uted for 
the aelectlon oj, oryiacystetjn cPNA cionea . The amino acid 
sequence* of peptides corresponding to the nucleotide sequence of 
the nRNA and the sequences of the two probes CA and B> are shown 
in Pig. 2-S. The peptides for probes A and B correspond to 
peaks 6 and 1, respectively (Pig. 1-S). 

Restriction map and sequencing atrate<rv £o_£ oryzacystatin 
cPNA inserts . The restriction map shows only the relevant sites 
for DNA sequencing. The protein-coding region la indicated by a 
filled box in rig. 3-S. The direction and the extent of 
sequencing are also shown by horlxontal arrows. Sequencing was 
done by the dldeoxy method of Sanger (solid lines) and the 
chemical modification method of Haxam-Gi lbert (dashed lines). 



llTMr, lieu) 



Peptide sequence : 



1 i' TTT GCA TCC TAC CTA AA 5* 



Lye Pro Val Asp Ala S«r 



|t7Mr, 1024*ts> 



Pig. 2-S. The sequences of the synthetic 
oligodeoxynucleotlde probata uaed for the 
selection of orysacystatin cONA .clones. 



ice qlutelln cPNA 



and the 



The nucleotide sequence of the 
predicted amino aqid sequence . The complementary 30-ner oligo- 
nucleotide corresponding to the glutei in cOKA sequence (23) at 
nucleotides 11-40 was synthesized. Using this probe, the same 
cONA library of immature rice seeds waa screened under conditions 
Similar to those described In EXPERIMENTAL PROCEDURES, Several 
positive clones were Isolated and the cONA insert of one of them 
( AOG12) w *» subcloned Into pUC18. Pig. 4-S shows the result. 
The partial nucleotide sequence of *OCl2 determined from both 
directions by tha chain-termination method (191 was completely 
Identical to the published nucleotide sequence of glutei in 
(nucleotides U-235 and 953-1 117) (23), indicating that iOGI 2 is 
a cDNA clone for glutei in. 



Table IS. 

Amino acid sequence analyses of fractionated peptides. 

Peak Amino acid sequence) 

94 102 
1 Lys-Pro- Val -Aap-Ala -Sex-Ala -A»o -Ala 

143 137 1S1 84 160 13 162 121 16 



XOC26 



rig. 3-S. Restriction map and sequencing strategy 
for oryzacystatin cDNA inserts. 



Lys-Leu-Tyr-Glu-Ala 
183 US 81 172 165 

78 80 
Lys-Val-Trp- 
271 2S8 71 

29 34 
Xaa-Pbe-Ale-Val-Thx-Clu-Bis- 
99 103 80 4 33 13 



82 87 
Ly a - Pro-Trp-Met - Aap- phe 
160 93 65 95 25 SO 



Tbe amounts of PTB-amlno acids detected (patole) are 
shown below the sequence. The residue numbers of 
amino acids are shown above the sequence. 



TJUUkTCTATACCACAATCC* 



ValAaaiMTyfClaAaaaUlMl^^rSTanwTrpAMlU 

rig. 4-S. The nucleotide sequence of the rice glutei in 
cONA and the predicted amino acid sequence. 




Fig. 1-S. Separation of peptides from an 
oryzacystatin digest. 
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Comparison to the barley amylase/subtilisin inhibitor 
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Abstract. We have cloned and sequenced a 650-nu- 
cleotide cDNA from barley (Hordeum vulgare L.) 
^ aleurone layers encoding a protein that is closely 
w related to a known a-amylase inhibitor from Indi- 
an finger millet (Eleusine coracana Gaertn.), and 
that has homologies to certain plant trypsin inhibi- 
tors. mRNA for this probable amylase/protease 
inhibitor (PAPI) is expressed primarily in aleurone 
, tissue during late development of the grain, as 

• compared to that for the amylase/subtilisin inhibi- 
tor, which is expressed in endosperm during the 
peak of storage-protein synthesis. PAPI mRNA 

, is present at high levels in aleurone tissue of desic- 
1 cated, mature grain, and in incubated aleurone 
layers prepared from rehydrated mature seeds. Its 

# expression in those layers is not affected by either 
r abscisic acid or gibberellic acid, hormones that, 

respectively, increase and decrease the abundance 
of mRNA for the amylase/subtilisin inhibitor. 
PAPI mRNA is almost as abundant in gibberellic 
acid-treated aleurone layers as that for a-amylase, 
9 and PAPI protein is synthesized in that tissue at 
levels that are comparable to a-amylase. PAPI pro- 
tein is secreted from aleurone layers into the incu- 
bation medium. 

Key words: Abscisic acid and mRNA induction 
- Aleurone - Amylase/Protease inhibitor - Gibber- 
•ellm and mRNA induction - Hordeum (aleurone 
Protein). 

' Sn r€ T^ 0nS: ABA " abscisi c acid; ASI = barley amylase/subtil- 
™ inhibitor; bp=nucleotide base pairs; Da = dalton; dpa = 
«y* post anthesis; GA 3 = gibberellic acid; PAPI=probable 

^A^c^fl inhibitor i Poly(A)RNA = polyadenyIated 

'■h-i fi>us * PAGE = sodium dodecyl sulfate-polyacrylamide 
^electrophoresis 

r 



Introduction 

Our laboratories are interested in the mechanisms 
by which specific genes are expressed under devel- 
opmental and hormonal controls in starchy endo- 
sperm and in aleurone tissue of barley (Hordeum 
vulgare L.). An approach to understanding this 
problem can be initiated by identifying and charac- 
terizing the genes for proteins that are expressed 
in abundance in specific tissues at specific develop- 
mental times. In this regard, much is known about 
the synthesis in developing endosperm of prolamin 
storage polypeptides (Mathews and Miflin 1980- 
Giese and Hopp 1984) and of several salt-soluble 
proteins (Jonassen et al. 1981; Ponz etal. 1983* 
Giese and Hejgaard 1984), but relatively little is 
known about the pattern of gene expression and 
of protein synthesis in developing aleurone tissue. 
Being derived developmentally from the endo- 
sperm, any separate function of the aleurone dur- 
ing grain development is not clear, and there are 
no data on a molecular level that clearly identify 
processes separate from those in progress in the 
endosperm. After rehydration of the desiccated 
grain however, the aleurone layer assumes a new 
role; its function is to destroy the now dead tissue 
from which it derived. In response to gibberellins 
produced by the embryo during germination, the 
aleurone layer synthesizes and secretes large quan- 
tities of hydrolases which mobilize the endosperm 
reserves for use by the growing plant. These hor- 
monally regulated events have been studied exten- 
sively (see Jacobsen 1984 for a recent review). 

There has been no evidence that aleurone tissue 
produces hydrolase-inhibitory proteins in any 
abundance. In contrast, many of the abundant 
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proteins in the endosperm of barley and other cere- 
als are inhibitors of a-amylases and of proteases 
(Shivaraj and Pattabiraman 1981; Boisen etal. 
1981; Odani etal. 1983; Weselake etal. 1983; 
Mundy etal. 1983). Recent work has shown that 
some of these proteins are closely related (Odani 
et al. 1983; Campos and Richardson 1983; Shewry 
et al. 1984). The inhibitors of trypsin from barley 
and of a-amylases from wheat share homologies 
with a Afunctional inhibitor of both a-amylase 
and trypsin from finger millet (Eleusine coracana 
Gaertn.) (Campos and Richardson 1983). All of 
them may now be considered members of the bar- 
ley trypsin-inhibitor family (Odani etal. 1983). 
Other Afunctional inhibitors have been found in 
barley and wheat which inhibit endogenous cereal 
a-amylase 2 isoenzymes as well as bacterial subtil- 
isins (Mundy etal. 1983; Weselake etal. 1984). 
The amino-acid sequences of these proteins, called 
ASI for amylase/subtilisin inhibitors, are homolo- 
gous to the sequence of the soybean trypsin inhibi- 
tor (Kunitz) and are therefore classified as 
members of that family (Mundy et al. 1984; Hej- 
gaardetal. 1983). 

The precise role of these inhibitors is not 
known, but the general assumption is that they 
may protect the grain from exogenous hydrolases. 
The ASIs, as inhibitors of the seed's own "germi- 
nation-specific" a-amylase (Weselake etal. 1983; 
Mundy et al. 1983), may also help to regulate the 
activity of this enzyme which is important in mobi- 
lizing the seed's starch reserves during germina- 
tion. In addition, protease inhibitors may contrib- 
ute towards the establishment of a balanced ami- 
no-acid pool in the endosperm storage proteins 
(Jonassen 1980; Hejgaard et al. 1985). 

In this paper, we describe a novel amylase/pro- 
tease inhibitor-type gene, identified by both its 
mRNA and protein, that is the first shown to be 
expressed primarily in aleurone tissue late during 
grain development, and that is expressed there at 
high levels after rehydration of the desiccated seed, 
regardless of the presence or absence of gibberellic 
acid (GA 3 ) or abscisic acid (ABA). The specificity 
of expression of this gene is established by compar- 
ing it to the expression of barley ASI, which is 
shown here to be primarily an endosperm protein 
and which is induced by ABA in mature aleurone 
layers incubated in short-term tissue culture. 

Material and methods 

Tissue preparation. Barley {Hordeum vulgar e L., cv. Himalaya, 
1979 crop) seeds were obtained from the Department of Agron- 
omy, Washington State University, Pullman, USA. Mature 



aleurone layers were prepared from imbided grains and incu- 
bated as described in Rogers and Milliman (1983); for incuba- 
tions with hormones, the buffer contained either 110~ 6 M 
GA 3 or 10" 5 M ABA (both hormones from Sigma Chemical 
Co, St. Louis, Mo., USA). 

Separation of aleurone and starchy endosperm tissues. Develop- 
ing grains were harvested at 20 and 30 days post anthesis (dpa), 
which correspond to the peak and end of storage-protein syn- 
thesis, respectively (Mathews and Miflin 1980; Giese and Hej- 
gaard 1984). At these stages, the developing starchy endosperms 
can be easily squeezed free of the surrounding aleurone-pericarp 
tissue after removal of the embryo. Light and scanning electron 
microscopy (Miindy et al. 1986) showed that these endosperms 
were free of aleurone tissue. The aleurone layers, however, often 
have several layers of endosperm cells attached to them. These 
cells were removed by longitudinally cutting the layers open 
and scrapping them with a scalpel. Following three washes in, 
phosphate-buffered saline (PBS; 0.1 M sodium phosphate, pH 
7.2, 0.015 NaCl), they were essentially free of endosperm cells. 

It was not possible to obtain aleurone and endosperm tis- 
sues by dissection in the dry seed because they are too tightly 
bound to each other. We therefore used mechanical abrasion 
to remove and collect the outer layers of the seed. Mature 
starchy endosperm and aleurone tissues were prepared by pro- 
gressively removing the outer portions of mature, dry grain 
in a rice polisher equipped with a file disc (Pearlest; Kett Elec- 
tric Laboratory, Tokyo, Japan). The aleurone fraction repre- 
sented the most external 3-20% fractional volume (inclusive) 
of the grains removed by abrasion, while the endosperm frac- 
tion represented the innermost 30-100% fractional volume (in- 
clusive). The extent of aleurone cell removal was assessed by 
examining the pearled seeds in an epifluorescence microscope. 
This sensitive, qualitative method takes advantage of the fact 
that aleurone cell walls are specifically rich in ferulic acid which 
fluoresces strongly in ultraviolet light (Jensen et al. 1.982). Endo- 
sperms pearled to 30% were free of aleurone cells except in 
recessed parts of the crease. These areas were removed with 
a scalpel. 

Isolation of and cDNA cloning RNA. Methods for isolation of 
RNA from aleurone layers, cDNA cloning, RNA and DNA 
gel electrophoresis, and DNA sequence analysis were as de- 
scribed in Rogers (1985). A complementary-DNA library con- 
taining 8000 clones were prepared from RNA isolated from 
aleurone layers that had been incubated for 20 h in the presence 
of ABA; it was screened for clones carrying sequences abundant 
in that tissue by colony hybridization with 32 P-labeled cDNA 
prepared from the same RNA source. RNA was isolated from 
other tissues utilizing the same method as for aleurone layers. 

Antibody preparation, a- Amylase and ASI were purified from 
malt (Mundy etal. 1983) and PAPI was purified from barley 
grains according to Svensson et al. (1986). Antibodies towards 
a-amylase and ASI were raised according to standard proce- 
dures, while antibody towards PAPI was prepared by immuni- 
zation after gluteraldehyde polymerization of the antigen (Bol- 
lum 1975). Immunoglobulins were purified from the crude anti- 
sera by chromatography on Protein-A Sepharose 4B (Pharma- 
cia Corporation, Stockholm, Sweden) by standard methods, 
and the protein concentration of the antibody preparations ad- 
justed to 4 mg-ml" 1 . 

Extraction and immune-blotting of proteins. Lyophilized tissues 
were ground in a mortar and pestle with 10 vol. of water, ex- 
tracted by rotating slowly for 3 h at room temperature, and 
then centrifuged at 10000 g for 5 min. Polypeptides in extract 
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aliquots containing equal amounts of protein were separated 
by sodium dodecyl sulfate-polyacrylamide gel electrophoresis 
(SDS-PAGE), and transferred electrophoretically in 0.15 M 
3-amino-2-(hydroxymethyl)-l,3-propanediol-(Tris)-glycine, 
pH 8.9, 20% methanol to nitrocellulose paper (type HAH Y ; 
Millipore Corp., Bedford, Mass., USA). The paper was then 
incubated with lOug-ml" 1 rabbit antibody and assayed with 
peroxidase-labelled swine anti-rabbit immunoglobulins (IgG) 
(Towbin et al. 1979). 

Analysis of polypeptides synthesized in vivo and in vitro. Polya- 
denylated RNA (poly(A)RNA) was translated, and the resul- 
tant protein products were analyzed by immunoadsorption and 
SDS-PAGE as described in Mundy etal. (1985) [^Methio- 
nine and rabbit reticulocyte lysate (N 90) were from Amersham 
(Arlington Heights, 111., USA). 

Proteins in incubated aleurone layers were labelled in vivo 
with [ 35 S]methionine and extracted into buffers containing SDS 
prior to dilution and immunoadsorption as described in Ander- 
son and Blobel (1983) and Mundy et al. (1985). Extract concen- 
trations of 0.25-1 0 6 , 1 -10 6 , 5-10 6 , 10- 10 6 and 20-10 6 cpm per 
600 ul immunoadsorption reaction mixture (specific activity 
110 4 -1.2 10 4 cpm ug" 1 protein) were used to optimize the 
ratio of antigen to antibody. To measure proteins secreted by 
aleurone cells, incubation media were first concentrated by 
freeze-drying, then dissolved in SDS buffer, and processed in 
the same way as the extracts. Because the amount of protein 
in the incubation media was much lower than that in the tissue 
extracts, only media aliquots containing M0 6 cpm 
(4.5 10 4 -13.7-10 4 cpm ug" 1 protein) were used. Phenylmeth- 
yisulfonyl fluoride (100 uM) and ethylenediaminotetraacetic 
acid (EDTA; 5 mM) were included in the immunoadsorption 
buffers to minimize proteolytic degradation. 

Met hods for sequence-homology analysis. The National Biomed- 
ical Research Foundation Protein Sequence Databank carried 
in the Washington University Medical School VAX- 11 com- 
puter was screened for sequences related to the PAPI protein 
with the SEARCH program of Dayhoffet al. (1983). The Ragi 
amylase inhibitor 1-2 sequence (Campos and Richardson 1984) 
was not present in the Databank. Detailed comparisons of pro- 
tein sequences thought to be related to the PAPI sequence were 
made with ALIGN, utilizing the mutation data matrix (250 
PAMs) +6, and a gap penalty of 8 (Dayhoffet al. 1983). 

Results 

Isolation of a cDNA representing a probable amy- 
lase/protease inhibitor. Replica plates of the prima- 
ry transformants were transferred tonitrocellulose 
filters and hybridized with 32 P-labeled cDNA syn- 
thesized from poly(A)RNA isolated from either 
untreated or from ABA-treated aleurone layers in 
a manner similar to that described in Rogers and 
Milliman (1983). Seven separate clones were identi- 
fied that hybridized strongly to both probes, indi- 
cating that they represented a mRNA that was 
abundant in either untreated or ABA-treated aleu- 
rone tissues. Plasmids isolated from all seven 
clones contained cDNA inserts with apparently 
identical sizes and preliminary restriction sites. 
Therefore one of the seven, identified as Alf, was 
arbitrarily selected for more detailed characteriza- 



tion. The cDNA insert was sequenced in its entire- 
ty by the chemical degradation method (Maxam 
and Gilbert 1980); both strands were sequenced 
and all restriction-enzyme sites used for end label- 
ling were confirmed by sequencing (Fig. 1 A). The 
complete nucleotide sequence, and the predicted 
sequence of the protein product are presented in 
Fig. IB. Because of the homology of the predicted 
protein sequence to known amylase and protease 
inhibitors (see below), it is referred to subsequently 
as PAPI, for probable amylase/protease inhibitor. 
The cDNA insert contains 650 base pairs (bp), not 
including the poly(dA) tail. The first ATG is pre- 
ceeded by a 67-bp leader sequence, and is followed 
by an open reading frame that terminates with 
TGA at nucleotide 419. The coding region is 64% 
G + C, a feature shared by both a-amylase cDNAs 
(Rogers and Milliman 1983; Rogers 1985) and the 
thiol protease cDNA (Rogers etal. 1985). At the 
3'-end, this sequence lacks the AATAAA polya- 
denylation signal usually found a short distance 
preceeding the poly(dA) tail, although the se- 
quence AATAAG is present beginning at position 
528, 122 nucleotides preceeding the poly(dA) se- 
quence. This sequence has been reported in a legu- 
min gene near the stop codon in a mRNA that 
also had the canonical AATAAA near the poly(A) 
tail (Croy et al. 1982), but the arrangement seen 
here appears to be one not previously reported. 
Although this cDNA has a long 5' untranslated 
region, we do not know that is, in fact, represents 
the full-length sequence of the mRNA, since 
primer extension and S x nuclease analyses have 
not been performed. Southern blots carrying bar- 
ley genomic DNA digested with restriction en- 
zymes not cutting within the cDNA sequence (such 
as Hindlll and EcoRi) were probed with the la- 
belled cDNA; single hybridizing bands were ob- 
tained, indicating that probably only one gene is 
present (data not presented). Thus we proceeded 
under the premise that our experiments measured 
the expression of at least a very small gene family, 
and most likely a single gene. Detailed analysis 
of multiple genomic clones will be needed to re- 
solve the question of the exact number of homolo- 
gous sequences. 

Svensson et al. (1986) purified a 10-kDa basic 
protein from barley grains and determined the se- 
quence of the first 37 N-terminal and the last two 
C-terminal amino acids. These sequences match 
that presented in Fig. IB exactly, beginning with 
residue 26. As noted by Svensson et al. (1986), the 
predicted protein sequence is closely related to that 
for a known a-amylase inhibitor from Ragi or In- 
dian finger millet (Campos and Richardson 1984); 
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Fig. 1. A Sequencing strategy for the cDNA insert. 
Restriction-enzyme fragments were end-labeled at 
the sites shown and sequence determinations were 
obtained for the regions indicated by the arrows. 
Restriction sites were: X, Xba I (in pUC18 
polylinker); R, Rsa I; B, BssH II; N, Nci I; A, 
Ava I; O, Nco I; and H, HindHI (in pUC18 
polylinker). Fragments were labeled at the 5' (•), 
or the 3' (o) ends. B Nucleotide and derived 
amino-acid sequence of the cDNA insert. The 
insert contains 650 nucleotides excluding the 
poly(dA) tail. The amino-acid sequence is presented 
for the only open reading frame and begins with 
the first ATG initiation codon 



90 100 110 120 130 1*0 150 I60 

KTACTGCTCATGGCCGCCCCCTra 

VLLMAAALVLHLTAAPRAAVALNCGQ 
10 20 30 

170 180 190 200 210 220 230 2«t 

TTGACAGCAAGATGAAACCTTGCCTGACCTACGTTCAGGGCGG^ 

VDSKMKPCLTKVQGGPGPSGECCNGVr 
HO 50 

250 260 270 280 290 300 310 320 

GATCTCC ATA ACCAGGCGCA ATCCTCGGGCGACWXXA A ACCGTTTGC A ACTGCCTGA AGGGGATCCCTCGCGGC ATCC ft 
DLHNQAQSSGDRQTVCNCL.KG I ARC K 
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CAATCTCAACCTCAACAACGCCGCCAGCATCCCXnT£AA 

NLNLKNAASIPSKCNVNVPYTISPDI 
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i|10 120 430 440 450 460 470 H8c 

ACTGCTCCAGGATTTA(nt}AGCGACGATCCGTCAAGCTC 
D C S R I Y * 

490 500 510 520 530 540 550 56c 

CTCTGTCCCTATGTAGTATGGCTACCAGTTATG(XGAGTTTATGCTC^ 

570 580 590 600 610 620 630 6«t 

GATCAGTATCTATGTACGTAGAGTTGAGAGTTTGTACXATGGGCACTCCCAGTGTTC 

i 

650 1 
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the alignment of these sequences is presented in 
Fig. 2 A. It can be seen that the two sequences, 
beginning with PAPI residue 26, have 45 out of 
96 residues identical, with two single residue gaps 
in the PAPI sequence necessary for optimal align- 
ment. The first 25 amino acids, which include a 
hydrophobic core, probably represent a signal pep- 
tide, since they are absent from the mature PAPI 
protein. In addition, the PAPI sequencers homolo- 
gous to known trypsin inhibitors. Presented in 
Fig. 2B are comparisons to two Bowman-Birk 
type inhibitors, one from eggplant (Solarium me- 
longena L.), and one from Macrotyloma axillare 
(family Leguminosae) (Joubert et al. 1979). Bow- 
man-Birk inhibitors are small proteins comprised 
of two repeating units, each unit having a different 
inhibitory specificity (Laskowski and Kato 1980). 
Different regions in the PAPI sequence have ho- 
mology to regions containing the active-site resi- 
dues in the trypsin-inhibitory domains of the two 
proteins. In addition, it can be seen (Fig. 2B, top) 
that apparently both N-terminal and C-terminal 
portions of the PAPI sequence are related to the 
Macrotyloma protein's trypsin-inhibitory domain, 
although in the C-terminal portion, alignment in- 
cludes a lysine residue corresponding to the active 
site. This type of protein does not fit with any 
of the previously established groups of protease 
inhibitors (Laskowski and Kato 1980). 



The availability of this protein, derived from 
the same gene as our cloned cDNA, and antiserum 
to it, presented the opportunity to study the ex- 
pression of this gene at multiple levels. We there- 
fore studied expression of the gene by assessing 
the relative abundance of hybridizable RNA in dif- 
ferent tissues. The presence of active mRNA was 
assessed by translating mRNA from the same tis- 
sues in vitro and immunoprecipitating PAPI. The 
expression of this mRNA in vivo was assessed by 
labeling with [ 35 S]methionine, and by determining 
the relative abundance of PAPI in different tissues 
by immune blotting. 

PAPI RNA is expressed constitutively at high levels 
in incubated aleurone layers, PAPI-cDNA labeled 
by nick translation hybridized strongly to an ap- 
prox. 700-nucleotide RNA in both untreated 
(Fig. 3 A: U) and ABA-treated (Fig. 3 A: A) aleu- 
rone tissue cultures. In contrast, RNA from either 
shoot (Fig. 3 A: S), or root (Fig. 3 A: R) contained 
no detectable hybridizing RNA, even on prolonged 
autoradiographic exposure of the blot (data not 
presented). The specificity of this hybridization 
pattern was demonstrated by hybridizing identical 
blots with probes for a-amylase (Rogers and Milli- 
man 1983) (Fig. 3 A: amylase) or for the GA 3 -in- 
duced thiol protease (Rogers et al. 1985) (Fig. 3 A: 
protease). Consistent with previous results (Rogers 
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A, Comparison between the PAP I sequence and the alpha-aaylase inhibitor 
from Indian Finger Millet (Ragi): 



10 20 30 40 50 

P AP I : HA RAQ VLLMA A AL VLMLTA A PRA A V ALNCCQ V DSKMK PCLTY VQC-CPGPSGECCNG V ROL 

RAGI : AISCGQVSSAIGPCLAYARGAGAAPSASCQ3GVRSL 

10 20 30 

70 80 90 100 110 

HNQAQSSGDRQTVCNC-LKGIARGIHKLNLNNAASIPSKCNVNVPYTISPDIDCSRIY 

NAAARTTADRRAACNCSLKSAASRVSGLNAGKASSIPGRCGVRLPYAISASIDCSRVNN 
40 50 60 70 80 90 

B. Comparison of PAPI to regions from two Bowman- Birk type trypsin inhibitors 
containing the active site residues: 

1. Inhibitor DE-4 from Macrotyloma axillare 

SCORE 

Common K P C C C N R 

PAPI 35-K MKPCLTYVQGGPGPSGEC-CNGV R-58 3.86 (<.0001) 

ft 

DE-4 11-SSKPCCDLCTCTKSIPPQCHCMDM R-35 

# 

PAPI 66-S SGDRQTVCNCLKGIARGIHNLML N-90 3-70 C0001) 

Common S S C C K I H 

2. Eggplant trypsin inhibitor 

SCORE 

Eggplant 29-N PENPKACPRNCDGRIAYGICPL S-52 

PAPI 88-M LNNAASIPSKCMVHVPYTISPDI-111 5.02 (<. 000001) 

Common N N PC YIP 

Fig. 2 A, B. Comparison of the PAPI amini-acid sequence to sequences for (A) an a-amylase inhibitor from ragi (Campos and 
Richardson 1984), and (B) protease inhibitors from Macrotyloma axillare (Joubert et al. 1979) and eggplant (Richardson 1979). 
Computer programs for searching the National Biomedical Foundation Protein Sequence Databank (SEARCH), and for optimal 
alignment and statistical evaluation of similarity (ALIGN) were from Dayhoff et al. (1983) and used as described in Material 
and methods. The protease-inhibitor sequences were selected for analysis because they were among the most likely candidates 
for homologous proteins as identified by SEARCH, The numbers bracketing the sequences in B designate the first and last 
residues, respectively, in the sequence. Asterisks indicate the active-site residues for the known protease inhibitors. Residues shared 
by the two sequences (common) are presented above or below the comparison. The alignment "Score" represents the number 
of standard deviations a particular comparison rates above that from comparison of randomly generated sets of sequences with 
the same amino-acid composition; the probability that a given score would occur on only a chance basis (Dayhoff etal. 1983) 
is presented in parentheses. In A, the alignment score was > 14, with a probability of < 10 ~ 12 that it was the result of chance 



1985; Rogers et al. 1985), the a-amylase probe de- 
tects mRNA that is present in untreated aleurone 
cells and is present at very low levels in shoot and 
root, while the protease probe detects mRNA that 
is present in untreated aleurone tissue but also is 
present in more abundance in shoot and root. The 
abundance of both mRNAs is decreased when the 
aleurone layers are treated with ABA. 

Hybridizable RNA can also be detected with 
the PAPI probe late during development of aleu- 
rone plus endosperm tissue. In Fig. 3 B, in compar- 
ison to RNA from untreated mature aleurone 
layers (lane U), little or no hybridizable RNA is 
present in endosperm-aleurone tissue harvested 
two weeks post anthesis (Fig. 3 B, lane 2), but in- 



creasing quantities are detected at three (lane 3) 
and four (lane 4) weeks post anthesis, such that 
the relative abundance in the four-week post an- 
thesis sample is greater than that detected in the 
samples from incubated aleurone layers. At ap- 
proximately five weeks the grain is desiccated and 
mature. 

In order to estimate the abundance of the PAPI 
mRNA in incubated aleurone layers, an experi- 
ment was performed comparing it to the abun- 
dance of a-amylase mRNA. Although no quantita- 
tive hybridization experiments have been per- 
formed that would allow an estimate of the number 
of amylase transcripts per cell, in-vitro translation 
experiments have demonstrated that these appear 
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Fig. 3 A, B. Distribution of hybridizable PAPI RNA in different 
barley tissues. 10-ug samples of total RNA were subjected to 
electrophoresis through a 1% agarose gel in the presence of 
formaldehyde, transferred to nitrocellulose, and hybridized (see 
Material and methods). A Hybridization to RNA from mature 
excised, incubated aleurone layers and to RNA from shoot 
and root tissue. Three identical blots were prepared and hybrid- 
ized with nick-translated plasmid DNA carrying the a-amylase 
clone E (Rogers and Milliman 1983) insert (left), the PAPI 
cDNA (center), and the aleurain thiol protease cDNA (Rogers 
et al. 1985) (right). The RNA samples were obtained from root 
(R) or shoot (S) tissue from recently germinated plants, or from 
excised, mature aleurone layers incubated for 20 h in the ab- 
sence (U) or presence (A) of 10 \iM ABA. The numbers to the 
side indicate the estimated size in kilobases of the hybridizing 
RNA; these estimates were derived from bacteriophage X Hind- 
III/£coRl fragments subjected to electrophoresis in parallel and 
hybridized separately on a different blot. B Hybridization to 
RNA from developing aleurone + starchy endosperm. Total 
RNA was isolated from the deembryonated grain two, three 
and four weeks post anthesis, and the abundance of hybridiz- 
able PAPI RNA was compared to that in AB A-treated aleurone 
(A) and shoot (5) RNAs as in A 



to be the most abundant mRNAs present (Mozer 
1980; Higgins et al. 1982). The comparison here 
(Fig. 4) utilized hybridization of cDNA probes, 
made either from poly(A)RNA from untreated or 
GA 3 -treated aleurone layers, to dot blots of equal 
quantities of plasmid DNAs carrying the a-amy- 
lase clone E (Fig. 4, lanes E), PAPI (lanes labeled 
A), and thiol-protease (lanes labeled G) inserts. 



The assumption is that all mRNAs present in the 
reverse-transcriptase reaction mixture will be tran- 
scribed with a frequency proportional to their 
abundance, and therefore the amount of radioac- 
tive label incorporated will reflect those propor- 
tions. (This assumption is useful only in a semi- 
quantitative way because there are variables, such 
as G + C content and secondary structure, that 
could alter the efficiency of transcription.) In a 
filter hybridization, where equal amounts of target 
sequences are present, the hybridization signal de- 
tected by direct autoradiography to each of the 
target sequences will reflect their relative abun- 
dances in the probe used. (Another correction 
should take account of the fact that the amylase 
cDNA is twice as large as that for PAPI, and thus 
should have twice the hybridization signal for a 
given molar amount. This is ignored here because 
of uncertainty about the average length of the la- 
beled cDNAs used for hybridization.) Thus, the 
intensity of hybridization to dots containing the 
PAPI plasmid DNA can be compared to that of 
dots containing amylase sequences to determine 
the relative proportions of those two species in the 
cDNA probe, and, within the constraints of the 
problem of reverse-transcription efficiency, can be 
used to estimate their relative abundance in cellular 
mRNA. 

The results of such an experiment are presented 
in Fig. 4A and B. Equal quantities (by weight) of 
amylase (E), PAPI (A), and thiol protease (G) plas- 
mid DNAs were spotted on two identical filters 
and hybridized with cDNA probes from untreated 
(Fig. 4 A), or GA 3 -treated (Fig. 4B) aleurone-cell 
mRNA. It can be seen that PAPI sequences appear 
to be the most abundant of the three in untreated 
cell RNA, while in GA 3 -treated RNA, the abun- 
dance of the amylase sequences appears to be at 
most fourfold greater than those for PAPI. Thus, 
the PAPI transcripts must be one of the most abun- 
dant in these cells. To establish that GA 3 treatment 
of aleurone tissue cultures did not appreciably 
change the abundance of PAPI transcripts, a dot 
blot containing equal quantities of total RNA iso- 
lated from untreated (Fig. 4C: U) cells, or cells 
that had been incubated in the presence of 2.5 |iM 
GA 3 for 12 (Fig. 4C: 12) or 24 (Fig. 4C: 24) hours, 
was hybridized with nick-translated PAPI cDNA. 
The relative amounts of PAPI sequences detected 
in those RNAs were similar. These results, in ag- 
gregate, indicate that PAPI mRNA is constantly 
expressed in aleurone layers isolated from rehy- 
drated mature grains, and its abundance is not al- 
tered to any significant degree by either of the two 
plant hormones, GA 3 or ABA. 
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Fig. 4A-C. Relative abundance of PAPI RNA in mature barley aleurone layers after incubation under different conditions. Panels 
A and B represent dot blots in which plasmid DNA was fixed to nitrocellulose and hybridized with oligo(dT)-primed, 32 P-labeled 
cDNA synthesized from poly(A)RNA isolated from aleurone layers incubated in the absence (A) or presence of 2.5 uM GA 3 
(B) for 18 h. The plasmid DNAs were: E, clone E a-amylase cDNA (Rogers and Milliman 1983); A, PAPI cDNA; G, aleurain 
thiol protease cDNA (Rogers etal. 1985). In each instance, the total amount of DNA applied to each dot was 1 ug; the excess 
over the plasmid DNA amount was made up of carrier salmon-sperm DNA. The amounts of plasmid DNA applied were: 
line 1, 1 ug; 2, 0.25 ug; 3, 0.06 ug; 4, 0.013 ug; and 5, 0.003 ug. In these experiments the standard hybridization buffer also 
included 5 ug-ml" 1 of poly(dA). In C total RNA from different sources was fixed to the filter and hybridized with nick-translated 
PAPI plasmid DNA. The RNA was derived from untreated aleurone layers incubated for 20 h (£/), or from layers that were 
treated with GA 3 for 12 or 24 h. Each dot contained 1 ug of RNA, with the difference above that from aleurone RNA made 
up of carrier yeast tRNA. The amounts of aleurone RNA for each line were; line 1, 1 fig, and the remainder, fourfold dilutions 
in amounts identical to those used for plasmid DNAs in A and B 




Translatable PAPI mRNA in seed tissues and ex- 
cised, mature aleurone layers. We wanted to com- 
pare the relative abundance of PAPI mRNA to 
that for ASI in seed tissues at various developmen- 
tal stages. This comparison would allow us to de- 
termine if the expression of PAPI differed from 
that of other abundant proteins synthesized during 
grain development. This seemed to be an appro- 
priate comparison because ASI is known to be an 
amylase/protease inhibitor, and is known to be 
present in both mature grain (Mundy et al. 1983) 
and excised, mature aleurone layers (Mundy 1984). 
Since no cDNA clone for ASI was available to 
us, hybridization experiments were not possible. 
Therefore the presence of mRNA for the two pro- 
teins in different tissues in developing and mature 
seed was assessed by isolating poly(A)RNA from 
those tissues, translating it in a reticulocyte system 



in the presence of [ 35 S]methionine, and specifically 
immunoprecipitating the two labeled proteins. 

Figure 5 presents autoradiograms of SDS- 
PAGE analysis of proteins synthesized by transla- 
tion of mRNA in vitro. It can be seen that mRNA 
for ASI is readily detected in 20- and 30-dpa 
starchy endosperm (Fig. 5 B, lanes 1 and 2), while 
little or no PAPI mRNA is present (Fig. 5C, 
lanes 1 and 2). In contrast, in endosperm from dry 
seed (lanes 3), no ASI mRNA could be detected, 
but relatively abundant PAPI mRNA was present. 
This demonstrates that ASI mRNA is present in 
endosperm at times corresponding to the peak of 
storage-protein synthesis, while PAPI mRNA ap- 
pears much later. In addition, in the developing 
grain, aleurone tissue shows striking specificity for 
expression of PAPI. No ASI mRNA was detected 
in aleurone tissue from 20 dpa (Fig. 5 B, lane 4), 
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Fig. 5A-C. Immunoprecipitation of PAPI, AS1 and a-amylase from in-vitro translation of mRNA from different tissues of barley 
grains. Presented are fluorograms from SDS-polyacrylamide gels carrying [ 35 S]methionine-labeled translation products (Anderson 
and Blobel 1983). A Total protein products; B immunoadsorptions with antibodies towards ASI (23 kDa) and a-amylase (44 
kDa); C immunoadsorptions with antibodies to PAPI (12 kDa). Tissue samples from which mRNA preparations were obtained: 
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Panel A was from 2-d exposure; panel B was from a 7-d exposure; panel C, lanes 1-6 were from a 28-d exposure, and lanes 
7-9 were from a 32-d exposure 



30 dpa (lane 5), or mature dry grain (lane 6), while 
PAPI mRNA was present beginning at 30 dpa 
(Fig. 5C, lane 5), and was substantially increased 
in dry seed (lane 6). PAPI was also present in the 
endosperm of mature grain (Fig. 5C, lane 3). 

In aleurone layers derived from rehydrated 
grains, the pattern of ASI (and a-amylase) mRNA 
abundance reflects the pattern of in-vivo protein 
synthesis previously described (Mundy 1984). 
(Since ASI and a-amylase are tightly associated, 
antisera to either native protein cross-react with 
the other.) Both ASI and a-amylase mRNA are 
present in untreated aleurone layers (Fig. 5B, 
lane 7). Treatment with ABA increases ASI and 
decreases a-amylase (lane 8), while treatment with 
GA 3 has an opposite effect (lane 9). The results 
observed with respect to translatable a-amylase 
mRNA are similar to those previously reported 
(Mozer 1980; Higgins etal. 1982), and demon- 



strate that the tissues utilized here responded ap- 
propriately to the hormones. In contrast, PAPI is 
expressed at similar levels regardless of the pres- 
ence or absence of hormones (Fig. 5C, lanes 7-9). 

The quantity of PAPI mRNA in aleurone tissue 
cultures, relative to levels in mature seed and rela- 
tive to ASI- and a-amylase-mRNA levels seems 
to be substantially lower in this experiment than 
in either the hybridization experiments (see above) 
or the in-vivo experiments (below). When similar 
experiments were performed utilizing wheat-germ 
extract instead of reticulocyte lysate to translate 
mRNAs, the abundance of PAPI mRNA relative 
to ASI and a-amylase was apparently even less 
than determined here (data not presented). Thus, 
the discrepancy in estimates of PAPI mRNA abun- 
dance as determined by hybridization and in-vivo 
labeling versus in-vitro translation of total mRNAs 
is not dependent upon the translation system. To 
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Fig. 6 A-C. Immunoprecipitations of PAPI, ASI and a-amylase from excised aleurone layers incubated in vivo with [ 35 S]methionine. 
Sets of 115 aleurone layers from de-embryonated half-seeds were incubated in 22 ml buffer containing 3.7* 10 7 Bq [ 35 S]methionine 
(4.3-1 0 1 3 Bq -minor 1 ) for 20 h in the absence, or presence of GA 3 or ABA. Presented are fluorograms of SDS-polyacrylamide 
gels containing labeled proteins selected by A, preimmune IgG; B ASI and a-amylase anti-IgG; C PAPI anti-IgG: 
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The fluorogram from panel B was exposed 7 d, while that from C was exposed for 28 d. The relative abundance of the various 
labeled proteins was estimated by scanning densitometry of the bands on the fluorograms, and the relative amounts in the 
different immunoprecipitates was calculated by dividing the scan peak area by the (days exposure) x (number of methionine 
residues in the mature protein) x ([ 35 S]methionine decay factor): 





Media 






Extract 






Un 


ABA 


GA 


Un 


ABA 


GA 


a-Amylase 


0.04 


0.0009 


0.15 


0.01 


0.07 


0.4 


ASI 


0.0001 


0.02 


0.0003 


0.004 


0.2 


0.002 . 


PAPI 


0,08 


0.07 


0.08 


0.05 


0.05 


0.05 



explain this discrepancy, we speculate that the 
PAPI mRNA may be translated with less efficiency 
in-vitro in the presence of other mRNAs present 
in abundance in incubated aleurone tissue. 

Synthesis of PAPI protein in aleurone layers incu- 
bated in vivo. The presence of relatively large 



amounts of mRNA for PAPI in mature aleurone 
layers indicated that PAPI protein would be one 
of the most abundant proteins synthesized in those 
tissues after rehydration of the desiccated grain. 
However, previous studies by others utilizing SDS- 
PAGE analysis of labeled by incubation of aleu- 
rone layers in the presence of radioactive amino 
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Fig. 7A-C. Immunoblotting assay of ASI and PAPI at different times during development of barley grains. Proteins separated 
by SDS-PAGE were transferred to nitrocellulose and detected by immunoperoxidase. Panel A presents a silver stain of the 
total proteins present in each sample (400 ug/lane). Panel B presents detection with anti-ASI IgG, and panel C, with anti-PAPI. 
Lanec represents 1 ug of purified protein standard for each. The proteins were extracted from grains 14 (lane 7), 18 (2), 26 
(J), 30 (4), 35 (5), and 44 (lane 6) dpa 



acids (Mozer 1980; Higgins et al. 1982) had not 
provided evidence for this. We therefore incubated 
excised, mature aleurone layers in the presence of 
[ 35 S]methionine with, or without, GA 3 or ABA, 
and the presence of newly synthesized, labeled 
PAPI was assessed by immunoprecipitation and 
SDS-PAGE autoradiography. These experiments 
were complicated by two factors: (1) mature PAPI 
proteins contain only a single methionine residue, 
and (2) relatively large amounts of PAPI protein 
are contained in mature aleurone layers (see be- 
low). Thus, newly synthesized protein is diluted 
out by already existing protein during immunopre- 
cipitation. In order to circumvent the latter prob- 
lem, we tried varying concentrations of tissue ex- 
tract (antigen) in immunoadsorption experiments. 
Extract concentrations of 0.25-10 6 , M0 6 , 5 10 6 , 
10 - 10 6 and 20 * 10 6 cpm per 600 |il reaction volume 
(specific activity 1 • 10 4 -1 .2 * 10 4 cpm * |ig" 1 protein) 
were used to optimize the ratio of antigen to anti- 
body. These results demonstrated that the intensity 
of the immunoprecipitated PAPI band, relative to 
other, nonspecific bands, was inversely correlated 
with the quantity of extract added to the standard 
amount of antiserum (data not shown). Thus it 
is likely that even the smallest amount of extract 
still represented a condition of antigen excess. For 
clarity in the figures, only selected sets of results 



are presented. In addition, to optimize the amount 
of protein available for antigen binding in the ex- 
tracts, we included SDS in our extraction and me- 
dia resolubilization buffers. Although this denatur- 
ing treatment aids solubilization and antigen re- 
cognition of certain proteins in freeze-dried sam- 
ples, it sometimes produces the fairly high levels 
of nonspecific adsorption of sample polypeptides 
to antibodies and immunoadsorbent seen here 
(Fig. 6 A, lanes 1-3). However, comparison of 
these lanes to lanes containing proteins adsorbed 
by specific antiserum clearly shows that a protein 
of 10 kDa (PAPI) is specifically recognized. 

Again, consistent with previous in-vivo experi- 
ments (Mundy etal. 1984), and with the results 
obtained with mRNA translation in vitro (above), 
ABA causes a decrease in a-amylase and an in- 
crease in ASI in tissue extract (Fig. 6 B, lane 5), 
while GA 3 has the opposite effect (lane 6). Gibber- 
ellic acid also causes the appearance of labeled a- 
amylase in the culture medium (lane 3). In con- 
trast, PAPI is found in the culture medium in levels 
that are little influenced by the hormones (Fig. 6C, 
lanes 1-3). Similar results are seen in tissue-extract 
samples (Fig. 6C, lanes 4-6). When the amount of 
tissue-extract sample used for immunoadsorption 
was increased fivefold (Fig. 6C, lane 7), the 
amount of specifically adsorbed PAPI (relative to 
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the background pattern of nonspecifically ad- 
sorbed proteins) decreased substantially. Increases 
of 20-fold in quantity of tissue extract over that 
used in lanes 4-6 yielded patterns that were not 
different from those obtained with preimmune se- 
rum (data not presented). 

We utilized these data to estimate the relative 
quantities of the three proteins synthesized under 
the different conditions (Fig. 6, legend). These cal- 
culations ignore the effect of unlabeled protein on 
efficiency of immunoprecipitation, and thus will 
substantially underestimate the amount of PAPL 
In addition, the relative values cannot be compared 
between one set and another (for example, between 
the control and GA 3 -treated sets) because the ami- 
no-acid pool sizes may differ substantially. Never- 
theless, these calculations demonstrate that PAPI 
is the most abundant of the three proteins in the 
media samples from control and ABA-treated 
aleurone, and amylase is apparently only twice as 
abundant in incubation medium from GA 3 -treated 
tissue. A similar pattern is obtained with the tissue 
extracts, except that ASI is the most abundant pro- 
tein of the three in ABA-treated tissue. It is impor- 
tant to note that the relative ratios of newly synthe- 
sized PAPI to a-amylase in this experiment in con- 
trol and GA 3 -treated aleurone layers are very simi- 
lar to the relative ratios of hybridizable RNA mea- 
sured in Fig. 4. This concordance of results from 
two independent experiments supports our asser- 
tion that the apparently low levels of translatable 
PAPI mRNA observed in Fig. 5 are likely to be 
an artefact. 

Steady-state levels of PAPI and ASI protein as- 
sessed by immune blotting. Immune blotting was 
used to assess the relative amounts of ASI and 
PAPI proteins in developing whole barley grains 
(Fig. 7). It can be seen that the 20-kDa ASI protein 
is already detectable at 14dpa (Fig. 7B, lane 1), 
and that its steady-state abundance progressively 
increases to the level detected in the 44-dpa mature 
sample (Fig. 7B, lane 6). In contrast, the 10-kDa 
mature PAPI protein is first detected in this experi- 
ment at 35 dpa (Fig. 7C, lane 5), and increases to 
the level present at 44 dpa (lane 6). The larger band 
detected in the PAPI control sample (Fig. 7C, 
lane C) may represent a small amount of prepro- 
tein present in the purified sample; a similar minor 
band of approx. 14 kDa was observed in some tis- 
sue-protein analyses (data not presented). In other, 
similar experiments (data not presented), the rela- 
tive abundance of PAPI in aleurone layers and en- 
dosperm from rehydrated mature grains was simi- 
lar to that detected in extracts from mature grain. 



Discussion 

We have described a cloned cDNA representing 
a barley protein (PAPI) with homologies to a 
known amylase inhibitor, and to trypsin-inhibitory 
domains of Bowman-Birk type Afunctional pro- 
tease inhibitors. However, to date, the PAPI pro- 
tein has been shown not to inhibit the action of 
trypsin, chymotrypsin, subtilisin, crude Aspergillus 
niger protease, and papain on synthetic and pro- 
tein substrates, nor has it inhibited a-amylases 
from human saliva, hog pancreas, Tenebrio moli- 
tor f barley malt, Aspergillus niger or Bacillus sub- 
tilis on soluble starch (Svensson et al. 1986). The 
determination of the target enzyme(s), if any, for 
PAPI, will take both time and perhaps luck, con- 
sidering the high specificity of protease and-or 
amylase inhibitor interactions (Laskowski and 
Kato 1980; Campos and Richardson 1983; Mundy 
et al. 1983; Odani et al. 1983). 

The importance of the protein and cloned 
cDNA presented here lies more in the tissue speci- 
ficity and abundance of its expression. We have 
demonstrated that, in contrast to ASI which is ex- 
pressed primarily, if not exclusively, in endosperm 
tissue temporally corresponding to the peak of 
storage-protein synthesis (Mathews and Miflin 
1980; Giese and Hejgaard 1984; Mundy et al. 
1986), PAPI is expressed late during grain develop- 
ment. Its abundant expression In the aleurone cells 
of mature seeds, a layer which comprises only a 
small fraction of the grain, and its abundant ex- 
pression in aleurone layers isolated after rehydra- 
tion of desiccated grains lead us to speculate that 
it may be targeted there to inhibit exogenous amy- 
lase and-or protease activity. 

In contrast to the effects of ABA and GA 3 
on the synthesis of a-amylase and ASI in excised 
barley aleurone layers, the synthesis of PAPI, mea- 
sured in vitro (Fig. 5), and in vivo (Fig. 6), is not 
influenced by incubation with either hormone. Be- 
cause the effects of these hormones on overall pro- 
tein synthesis in aleurone layers is profound (Ho 
and Varner 1976; Mozer 1980), the apparent con- 
stitutive synthesis of PAPI in these cells indicates 
that the protein plays a role in maintenance of 
tissue integrity. A similar pattern of synthesis has 
been shown for the barley chymotrypsin-inhibitor 
1 protein (Mundy et al. 1986), a result which lends 
support to our speculation that germinating aleu- 
rone cells constitutively synthesize proteinase in- 
hibitors as some sort of protective response. Inter- 
estingly, at least some of the de-novo synthesized 
PAPI can be identified in the media of cultured 
layers (Fig. 6), indicating that the protein is se- 
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creted by aleurone cells. That PAPI is secreted is 
consistent with the presence of a signal peptide 
as determined from the cDNA sequence. 

Experiments quantitating the relative abun- 
dance of hybridizable PAPI RNA (Fig. 4), and the 
quantity of PAPI protein synthesized in vivo 
(Fig. 6), both indicate that PAPI mRNA and pro- 
tein are expressed at levels comparable to a-amy- 
lase in GA 3 -treated aleurone tissue. Experiments 
assessing PAPI mRNA levels, as judged by ability 
to produce PAPI protein in a cell-free protein-syn- 
thesis system (Fig. 5), are however at variance with 
these results. We choose to favor the agreement 
displayed by two different measurements, and sug- 
gest that the latter results may be the consequence 
of some selective disadvantage PAPI mRNA has 
in cell-free translation systems relative to other 
aleurone mRNAs. This remains to be proved. 
However, this observation, coupled with the pau- 
city of methionine residues in the protein, might 
explain why other studies utilizing cell-free transla- 
tion of RNA from excised aleurone layers (Mozer 
1980; Higgins et al. 1982) also did not identify a 
prominant 12-kDa protein product. 

Both hybridization (Fig. 3B) and translation- 
immunoprecipitation (Fig. 5C) experiments indi- 
cate that PAPI mRNA must be one of the most 
abundant mRNAs late during aleurone cell devel- 
opment and in mature, dry seeds. As demonstrated 
by the disappearance of ASI mRNA in mature 
endosperm (Fig. 5B), most mRNAs specifying 
storage and other proteins synthesized during the 
peak of endosperm development are lost as the 
seed matures (Mathews and Miflin 1980; Giese 
and Hejgaard 1984; Mundy etal. 1986). This 
abundance of PAPI mRNA in the dry seed raises 
the question of whether the PAPI mRNA present 
in aleurone layers after rehydration is actively syn- 
thesized or whether it represents stored mRNA 
synthesized prior to desiccation. This question can 
be best addressed by runoff transcription experi- 
ments utilizing barley-aleurone protoplast nuclei 
(Jacobsen and Beach 1985). The PAPI sequence 
shares one feature with genes that are expressed 
in aleurone cells after rehydration but not in devel- 
oping seed, namely the two oc-amylase (Rogers 
1985) and the aleurain thiol protease (Rogers et al. 
1985) genes, and that is a high (64%) G + C con- 
tent. In this regard it differs substantially from two 
genes that are expressed in endosperm during the 
peak of storage-protein synthesis; hordein (Ras- 
mussen et al. 1983) storage-protein and protein-Z 
(Hejgaard et al. 1985) genes both have about 50% 
G + C in their coding sequences. Whether the high 
G + C content of PAPI and the GA 3 -induced aleu- 



rone genes, and the complex secondary structures 
that result (Rogers 1985; Rogers et al. 1985) have 
any role in mRNA stability is as yet unclear. 

Our results also provide additional data regard- 
ing the expression of ASI. This protein is presum- 
ably synthesized in endosperm cells to protect the 
seed proteins against microbial attack and to limit 
the degradation of starch granules by the plant's 
own a-amylase. This enzyme activity must be care- 
fully regulated in cereals of temperate zones which 
are often stimulated to germinate precociously by 
wet weather. By inhibiting low levels of amylase 
activity secreted into the endosperm, ASI may act 
as one of many factors which mediate the change 
from dormancy to germination in these seeds. In- 
terestingly, ASI is not synthesized in developing 
aleurone tissue (Fig. 5), but its synthesis can be 
enhanced in mature, excised aleurone layers by 
ABA and reduced by GA 3 (Fig. 5). Such a pattern 
of expression and hormonal control in mature 
aleurone layers is not seen for several other seed 
proteins tested to date (Mundy etal. 1986). This 
indicates that the ABA induction of ASI synthesis 
and of a limited number of other polypeptides is 
specific and may be related to their physiological 
functions. Although the general mechanism of 
ABA control of protein synthesis in cultured layers 
is not known (Ho and Varner 1976), its effect on 
ASI probably occurs at the level of mRNA abun- 
dance. Abscisic acid has been shown to prevent, 
at least in part, the increase in levels of mRNAs 
encoding a-amylase in GA 3 -treated mature aleu- 
rone layers (Mozer 1980; Higgins et al. 1982). In 
addition, type A or, more properly, a-amylase 1 
(Svensson et al. 1985) mRNA is expressed at sub- 
stantial levels in untreated mature aleurone layers 
(Rogers 1985), and ABA treatment causes a sub- 
stantial decrease in this base-line expression 
(Fig. 3). Taken together, these results support the 
idea that ASI synthesized in aleurones represents 
an additional co- or post-translational control of 
endogenous a-amylase activity that is promoted 
by the same hormone that itself causes a decrease 
in a-amylase mRNA 

This work was supported in part by grant 83-CRCR-1-1332 
from the U.S. Department of Agriculture. 
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Summary. cDNA and genomic DNA clones for a soybean 
(Glycine max) seed lipoxygenase-3 (lox-3) isozyme have 
been isolated and characterized by molecular cloning and 
sequence analysis. The cDNA clone p3H9 was identified 
as encoding a lipoxygenase isozyme based on hybridization 
selection and immunoprecipitation assays. The insert from 
p3H9 was used to screen 700,000 recombinants in a phage 
genomic library. Two overlapping genomic clones were iso- 
lated. After localizing the mRNA coding region to the mid- 
portion of one of these clones, nucleotide sequence analysis 
was performed. The structure of the gene was deduced fol- 
lowing the additional cloning and sequencing of nearly full 
length cDNAs. Primer extension and SI protection experi- 
ments were used to define the 5' boundary of the gene. 
The gene is encoded by nine exons interrupted by eight 
introns. All exon-intron junctions conform to the GT-AG 
rule. The deduced amino acid sequence consists of 857 ami- 
no acids with a molecular weight of 96,663 daltons. The 
5' untranslated segment contains approximately 46 nucleo- 
tides before the first methionine (ATG) codon. A typical 
TATA box occurs some 30 nucleotides upstream of the 
message start site while a CAAG sequence occurs in place 
of a CAAT box some 80 nucleotides upstream of the start 
of transcription. Four polyadenylation signals are present 
in the 3' untranslated region. At least two of these are used, 
giving rise to cDNA clones bearing 3' untranslated seg- 
ments of 148 and 196 nucleotides. To identify which 
member of the lox gene family had been isolated, a gene- 
specific probe from the 3' untranslated region of p3H9 was 
hybridized to dot blots of soybean embryo RNAs derived 
from lox-1, -2, or -3 null mutants. No hybridization to 
RNA from the lox-3 null mutant was seen, while the re- 
maining mutant RNAs showed no reduction in hybridiza- 
tion. Further confirmation that the gene encodes a lox-3 
isozyme was obtained by a comparison of deduced and 
determined amino acid sequence data. An analysis of the 
lox-3 sequence and a comparison to other available soybean 
lox sequences is also presented. 

Key words: cDNA and genomic clones - Glycine max - 
Lipoxygenase - Soybean seed protein 
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Introduction 

Lipoxygenases (linoleate : oxygen oxidoreductase, EC 
1.13.11.12) are a widely distributed group of nonheme iron- 
containing enzymes that catalyze the oxygenation of po- 
lyunsaturated fatty acids or their esters containing a cis, 
ds-l,4-pentadiene system to form monohydro peroxides as 
primary products. At least three lipoxygenases have been 
identified in mammalian systems and they are the focus 
of intensive interest due to their role in the biosynthesis 
of prostaglandins, leukotrienes and lipoxins (for reviews, 
see Samuelsson 1983; Needleman et al. 1986). On the other 
hand, the physiological role of plant lipoxygenases (lox) 
is poorly understood. The enzyme is thought to play a role 
in fruit ripening and abscission (Veldink et al. 1977), senes- 
cence (Leshem 1984), and resistance to plant pathogens 
(Lupu etal. 1980; Hildebrand et al. 1986; Ocampo et al. 
1986). Lox has been the focus of strong commercial interest 
because its activity in a variety of foods can result in the 
generation of objectionable as well as pleasant flavors aris- 
ing from aromatic aldehydes and alcohols generated 
secondarily from lox-produced hydroperoxides (for reviews, 
see Axelrod 1974; Eskin etal. 1977; Galliard and Chan 
1980). From a commercial viewpoint the negative and posi- 
tive aspects of lox activity in a major crop such as soybean 
are of particular interest. On the negative side, the rancidity 
of soy oil and the poor quality of soy protein are thought 
to arise as a consequence of lox activity (Mustakas et al. 
1969; Wolf 1975; Hildebrand and Kito 1984). On the posi- 
tive side, defatted soy extract is an important component 
in bread-making. Lox, which is the active ingredient in the 
extract, is responsible for two processes: (i) the bleaching 
of carotenoid pigments, resulting in a whiter bread, and 
(ii) reactions involving oxidation and subsequent crosslink- 
ing of wheat glutens, resulting in enhanced texture (for re- 
view, see Eskin et al. 1977). 

Four lox isozymes have been identified in soybean seeds 
(Axelrod etal. 1981), and as a group they constitute 
l%-2% of the total seed protein (Vernooy-Gerritsen et al. 
1983). The enzyme consists of a single polypeptide chain 
of roughly 95,000 daltons. The isozymes have been divided 
into 2 classes or types that differ in substrate preference, 
heat stability, pH optimum, and response to calcium. The 
enzyme crystallized by Theorell et al. (1947) is a type-1 en- 
zyme and has been designated lox-1. It is heat stable, has 



a pH optimum of about 9.0, and prefers anionic substrates 
(fatty acids). The type-2 enzymes, lox-2 (Christopher et al. 
1970) and lox-3 (Christopher et al. 1972), are more easily 
inactivated by heat, have pH optima closer to neutrality 
and prefer esterified substrates. Lox-2 activity is stimulated 
by calcium, while lox-3 is inhibited. Lox-3 can be chromato- 
graphically resolved into two fractions, designated 3A and 
3B, whose properties are virtually indistinguishable. Addi- 
tional differences also exist between the isozymes. Lox-3 
generates singlet oxygen with higher efficiency than lox-1 
or lox-2 (Kanofsky and Axelrod 1986). The enzymes also 
differ in the proportions of 9- and 13-hydroperoxides 
formed. Lox-1 shows a preference for the 13 position, while 
lox-2 and lox-3 utilize the 9 or 13 positions in roughly 
equal proportions (Christopher and Axelrod 1971). 

The involvement of lox in determining the quality of 
soybean products makes it an interesting candidate for 
study. As an approach to understanding how lox is regu- 
lated, we have undertaken the molecular cloning and analy- 
sis of a soybean lipoxygenase gene. Recently, the complete 
amino acid sequence of soybean lox-1 was deduced from 
its corresponding cDNA (Shibata et al. 1987). A partial 
sequence for a soy "lox-1 -like" protein has also been re- 
ported (Start et al. 1986). Here, we present the first com- 
plete nucleotide sequence and structure of a soy lox-3 gene. 

Materials and methods 

Isolation of size-fractionated poly (A)* RNA. Polysomes 
from mid-maturation stage soybean embryos (Glycine max; 
variety Dare) were isolated as described by Goldberg et al. 
(1981). After pelleting through a cushion of CsCl, the poly- 
somal RNA was enriched for poly(A) + mRNA by adsorp- 
tion to oligo (dT) cellulose. The eluted material was sub- 
jected to zone centrifugation at 15° C in the Spinco SW41 
rotor through a linear 5%-30% sucrose gradient in 20 mM 
Tris (pH 7.6), 10 mM NaCl and 0.1% SDS. Centrifugation 
was for 11 h at 36,000 rpm. Fractions were collected by 
precipitation in ethanol, then resuspended in H 2 0. Aliquots 
of each fraction were translated in a reticulocyte in vitro 
translation system as described by the manufacturer (Pro- 
mega) and the translation products separated by one-di- 
mensional denaturing polyacrylamide gel electrophoresis. 
Fractions showing substantial enrichment of poly(A) + 
mRNA encoding for proteins greater than 90,000 daltons 
were pooled and used for cloning. 

Construction and screening of soybean embryo cDNA librar- 
ies. Construction of the initial cDNA library from size- 
fractionated poly(A) + RNA followed procedures described 
previously (Yenofsky et al. 1982). The library was screened 
according to the procedure of Grunstein and Hogness 
(1975) with a cDNA probe made from the same RNA that 
was used to construct the library. Replicas of the library 
were hybridized with the cDNA probe for two different 
time periods (3 and 17 h) in order to be able to distinguish 
between clones derived from abundant and less abundant 
messages in the labeled cDNA population. Clones exhibit- 
ing rapid. hybridization with the labeled probe were selected 
for further analysis. 

To obtain full- or nearly full-length cDNA clones, a 
second library was made from total polysomal poly(A) + 
RNA according to Gubler and Hoffman (1983), except that 
the double-stranded cDNA was first size-fractionated to 



enrich forcDNAs greater than 1.500 base pairs before join- 
ing it with the vector pBR322. This library was screened 
with the insert isolated from p3H9, a partial-length lox 
clone obtained from the first library. Approximately 2,000 
transformants were screened, and 16 reacted with the insert. 
Restriction analysis indicated that 13 were homologous to 
p3H9. Of these, 4 were close to full-length. 

Hybrid selection, immunoprecipitation and RNA blot proce- 
dures. RNA denaturation, fractionation and transfer to ni- 
trocellulose, as well as hybridization selection, were carried 
out as previously described (Yenofsky et al. 1982) except 
that the incubation temperature for hybrid selection was 
50° C. Immunoprecipitation of in vitro translation products 
with Staphylococcus aureus Cowan 1 strain cells was per- 
formed as described by the manufacturer (The Enzyme 
Center, Boston, Mass.). RNA was dotted onto Zeta-Probe 
membranes following the manufacturer's protocol (Bio- 
Rad). 

Isolation of lipoxygenase genomic clones. A " Haelll-Alul 
partial " soybean genomic library, provided by R. Gold- 
berg, was screened by plaque hybridization with the isolated 
lox cDNA insert from clone p3H9, using standard condi- 
tions (Maniatis et al. 1982). 

DNA sequencing. Labeled genomic DNA fragments were 
sequenced by the method of Maxam and Gilbert (1980). 
Over 90% of the genomic sequence was determined from 
both strands. cDNA clones were sequenced using both 
chemical degradation and dideoxy chain termination meth- 
ods (Sanger et al. 1977). When double-stranded plasmid 
DNA derived from minilysates was used as the source of 
material for dideoxy sequencing, we encountered severe 
background problems. To overcome this, minilysates were 
pretreated with Bal 31 nuclease for 10 min at 30° C, under 
conditions described by the manufacturer (New England 
Biolabs). The reaction was terminated by addition of 
EGTA to a final concentration of 20 mM and the DNA 
was phenol extracted and ethanol precipitated. The Bal 31 
does not affect closed circular DNA, while degrading other 
nucleic acids in the minilysate. 

Primer extension and SI protection. For the primer exten- 
sion experiment, a 106-nucleotide-long Fspl-Haelll geno- 
mic fragment (nucleotides 920-1025, Fig. 5) was kinase la- 
beled at the Haelll site, mixed with 25 fig of total polysomal 
RNA or yeast tRNA in 25 ul of 40 mM PIPES (pH 6.4), 
400 mM NaCl, 80% formamide, 1 mM EDTA, denatured 
and allowed to anneal for 3 h at 42° C. The reaction mix 
was diluted with an equal volume of ice-cold 10 mM Tris 
(pH 8.0), 1 mM EDTA, collected by ethanol precipitation 
and resuspended in 50 ul of 50 mM Tris (pH 8.3), 50 mM 
KC1, 1 mM dithiothreitol (DTT), 10 mM MgCl 2 , 2 mM 
of each of the four dNTPs, and 100 units of reverse tran- 
scriptase. The reaction was incubated at 42° C for 100 min 
before being terminated by ethanol precipitation, after 
which the products were resuspended in loading buffer and 
analyzed by gel electrophoresis. 

For SI protection studies, a 346-nucleotide-long Ahalll- 
Fspl fragment (nucleotides 579-919, Fig. 5) was kinase la- 
beled at the Fsp\ site, mixed with total polysomal RNA 
or yeast tRNAs, denatured and allowed to anneal as de- 
scribed above. The reactions were terminated by diluting 



into 300 ul 0.28 M NaCl, 50 mM sodium acetate (pH 4.5), 
4mM ZnSo 4 . Twenty-five hundred units of SI nuclease 
were added and the reaction was incubated at 37° C for 
30 min, at which time an additional 500 units of S1 were 
added for another 20 min. The reaction mix was phenol 
extracted prior to collection by ethanol precipitation, and 
the products were analyzed as above. 

Results and discussion 

Isolation of cDNA clones 

Poly(A) + RNA was obtained from mid-maturation stage 
soybean embryos and then size-fractionated to enrich for 
lipoxygenase mRNA. A cDNA library was constructed 
with this enriched RNA and subsequently screened with 
a cDNA probe made from the same RNA. Clones were 
chosen based on their ability to hybridize rapidly with the 
enriched probe. Positively reacting colonies whose plasmids 
hybridized to mRNAs of appropriate size (greater than 
2.8 kb) on Northern blots were chosen for hybrid selection 
experiments. Three clones were isolated by this screening 
strategy. The clone with the largest insert (approximately 
580 bp), designated p3H9, hybridized to an mRNA slightly 
less than 3,000 nucleotides in length (Fig. 1, lane 1) and 
was able to hybrid select an mRNA encoding a protein 
which comigrated with purified lox (Fig. 2, lane 3). 




Fig. 1. Electrophoretic behavior of mRNA species homologous to 
recombinant plasmid p3H9. RNA from soybean embryos was elec- 
trophoresed under denaturing conditions through an agarose gel 
and subjected to Northern blot analysis in the presence of nick- 
translated p3H9. Lane 1, 5 ng soybean total polysomal RNA; lane 
2, 3 ng pBR322 molecular weight markers. The numbers refer to 
the size of the markers, in kilobases 

Fig. 2. Translation products derived from hybrid selected mRNA. 
Soybean embryo RNA was translated in a reticulocyte lysate in 
vitro translation system in the presence of 35 S-methionine and the 
products were electrophoretically separated in denaturing poly- 
acrylamide gels followed by fluorography. Lane 1 , no added RNA ; 
lane 2, total polysomal RNA; lane 3, RNA hybrid selected by 
cDNA clone p3H9; lane 4, RNA hybrid selected by pBR322 con- 
trol. The arrow indicates the migration position of a purified soy- 
bean lipoxygenase marker 
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Fig. 3. Characterization of affinity-purified lipoxygenase antibody 
and immunoprecipitation of hybrid selection products. Reticulo- 
cyte lysates were programmed with either total polysomal RNA 
(lanes 1 through 5) or RNA obtained through hybrid selection 
(lanes 6-8). Lanes 1 and 2, as in Fig. 2. Translation products in 
lanes 3-8 were immunoprccipitated by affinity purified lox anti- 
body alone (lanes 3 and 6) or in the presence of excess unlabeled 
lox (lanes 4 and 7), or urease (lanes 5 and 8). The position of 
molecular size markers is indicated on the side, in kilodaltons 



To further characterize the clone, we obtained an affini- 
ty-purified lox antibody from T. Kaye Peterman. This anti- 
body was able to specifically bind the lox synthesized in 
a reticulocyte in vitro translation system programmed with 
total soybean embryo mRNA (Fig. 3, lane 3). Further, the 
binding of labeled lox by the antibody could be competed 
out by incubating the lysate in the presence of excess unla- 
beled lox (Fig. 3, lane 4), but not by urease (Fig. 3, lane 5), 
a protein found in soybean embryos, the size of which is 
similar to lox. When hybrid selected RNA was translated 
in vitro and subjected to the same immunoprecipitation 
experiment, identical results were obtained (Fig. 3, lanes 
6-8). On the basis of these results, we concluded that clone 
p3H9 encodes a lox isozyme. 

In order to determine which lox cDNA had been iso- 
lated, total RNAs from mid-maturation stage soybean em- 
bryos of mutants lacking either lox-1, -2, or -3 were probed 
with p3H9/3\ a subclone of p3H9 that contains only the 
gene-specific 3' untranslated region of the cDNA (Fig. 4). 
No hybridization was observed with the RNA from the 
mutant lacking lox-3, while hybridization to the RNAs 
from the mutants lacking lox-1 or lox-2 was equal to that 
of the RNA from the wild type, suggesting that the gene 
encodes a lox-3 isozyme. 

Isolation of genomic clones 

Genomic clones were isolated from a " Hae\l\-Alul partial" 
soybean genomic library by screening with the nick-trans- 
lated p3H9 insert. Two positive clones, designated 3H9.7.1. 
and 3H9.19.1, were identified after screening 700,000 re- 
combinants. Restriction analysis (Fig. 5) indicated that the 
two clones contain overlapping regions of the same gene. 
The sizes of the two cloned genomic fragments were roughly 
15 and 12 kb, respectively, for 3H9.7.1 and 3H9.19.1. 

As seen in Fig. 5, EcoKl digestion of these clones gener- 
ated a number of fragments within the insert. The location 
of the mRNA coding region was determined by hybridizing 
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Fig. 4. Dot blots of RNAs from soybean lipoxygenase-null soybean 
mutants. Total polysomal RNAs from the indicated soybean lox 
phenotypes were dotted in duplicate (5 |ig per dot) onto Zetaprobe 
membrane. tRNA was applied in the same amount as a control 
for nonspecific binding. Hybridization was with p3H9/3', a gene- 
specific subclone of p3H9 containing 172 bp of 3' untranslated 
sequence (from positions 4975 to 5147 in the genomic sequence) 
in pGEM3. The plasmid was nick-translated and hybridized to 
the dot blot as described in the Materials and Methods 



3H9.7.1 



1.2.4 1.7 1.5 .7 2.3 3.2 3.9 .4 

ill I II I I II //- 



k 21 i^ii f' 1 i 1 ' 2 n 1,7 j_^Llj_^- 



-ft — 3H9.19.1 



Fig. 5. EcoRl restriction map of cloned soybean genomic DNAs 
containing a lipoxygenase gene, and sequence determination strate- 
gy. Thick lines represent cloned genomic DNA. Thin lines are 
lambda DNA arms. Numbers located between short vertical lines 
represent the approximate distance, in kb, between adjacent EcoRl 
sites. The expanded map below clone 3H9.19.1 shows sequencing 
strategy. Arrows indicate direction and extent of analysis. Sequenc- 
ing of upper or lower strand is shown by the positioning of the 
arrows above or below the solid line, respectively 



individual nick-translated EcoRl fragments to Northern 
blots of soybean embryo RNA. The RNA-reactive probes 
from each clone were the 1.2, 0.4, 1.7, 1.5 and 0.7 kb frag- 
ments (data not shown). We subcloned each of these frag- 
ments from 3H9.19.1 into the EcoRl site of pBR322 and 
performed nucleotide sequence analysis according to 
Maxam and Gilbert (1980). 

Structure of a lox-3 gene 

We sequenced a 5,708 bp DNA segment encompassing all 
of the 1.2 kb EcoRl fragment and extending through the 
0.7 kb EcoRl fragment (Figs. 5, 6). The structure of the 
lox gene was deduced following the additional cloning and 
sequencing of nearly full length cDNAs, and by performing 
primer extension and SI protection experiments to deter- 
mine the 5' boundary. The complete sequence and structure 



of the gene is shown in Fig. 6. The results of the primer 
extension and SI protection experiments shown in Fig. 7 
are in good agreement, and support the assignment of the 
mRNA start site to the region around nucleotide 816 (see 
Fig. 6), although there is an apparent heterogeneity of sev- 
eral nucleotides in the start of the lox gene. This assignment 
is also supported by the relative position of the "TATA" 
box spanning nucleotides 781 through 786 (TATA A A). The 
initiation of transcription for most eukaryotic genes occurs 
some 30 nucleotides downstream from the TATA box. In 
addition, although a typical CAAT box is not present, a 
CAAG sequence is located roughly 80 nucleotides upstream 
(nucleotides 732-735) from the message start site. The se- 
quence of 816 bp that extends upstream from the message 
start site was compared with upstream sequences of several 
other soybean genes that are expressed in embryos. Exclud- 
ing the TATA or CAAT regions, no obvious homologies 
that may- indicate functionally equivalent promoter or regu- 
latory regions were evident. 

The first ATG (methionine) codon after the start of 
transcription occurs at nucleotide 862 in the genomic se- 
quence, indicating that the lox coding sequence is preceded 
by a 5' untranslated region of about 46 nucleotides. The 
nearly full-length cDNA that was sequenced, p3H9FL, ex- 
tends 10 nucleotides into the 5' untranslated region and 
contains a 2571-nucleotide-long open reading frame. This 
is followed by a TGA stop codon and a 3' untranslated 
region of 148 nucleotides, which terminates at position 5101 
in the genomic sequence. The smaller cDNA clone, p3H9, 
spans nucleotides 4676 through 5149 in the genomic se- 
quence, demonstrating that at least two of the four consen- 
sus polyadenylation signals present in the 3' untranslated 
region at positions 4989, 5079, 5084 and 5119 are utilized. 
The 3' untranslated region of p3H9 is 196 nucleotides in 
length. It should be noted that both cDNA clones contain 
poly(A) tails. The open reading frame of p3H9FL encodes 
857 amino acids, with an estimated molecular weight of 
96,663 daltons, a size consistent with that observed after 
denaturing polyacrylamide gel electrophoresis of the in vi- 
tro translation product (Fig. 3). 

A comparison of the genomic sequence with the cDNA 
sequence indicates that the gene consists of nine exons and 
eight introns. The exons range in size from 86 to 950 bp, 
while the intron sizes vary from 93 to 448 bp. The intron 
sequences bordering the intron-exon junctions all conform 
to the GT-AG rule. The GC content of the exons and in- 
trons is 41.6% and 28.3%, respectively. 

Comparison of lox isozymes 

A comparison of the deduced amino acid sequences of var- 
ious lox isozymes derived from their corresponding cDNAs 
is shown in Fig. 8. The complete amino acid sequence for 
lox-1 is from Shibata et al. (1987) and the partial sequence 
for the isozyme designated 4i lox-1 -like " is from Start et al. 
(1986). The designation "lox-1 -like" was based on the fact 
that the clone encoding this sequence (pLX-65) showed re- 
duced hybridization to RNA from a lox-1 null mutant, 



Fig. 6. Nucleotide and predicted amino acid sequence of a soybean lipoxygenase-3 gene. Position 1 corresponds to the EcoRl site 
on the 5' side of the 1.2 kb EcoRl fragment of lambda clone 3H9.19.1. Major (*. •) and minor ( + , o) products from primer extension 
(*, + ) arid SI protection (•, o) experiments are indicated. Single underline, CAAT-like box; double underline, TATA box; overline, 
consensus polyadenylylation signals; arrows, locations of alternative poly(A) addition sites 



,\ATTCATTTCTAATCTAAAATTAAATATCTAAAAATATATCTAAAA1CAATTATGAATTCAGAATTAACT 7 u 

TTTTAACCTTAAACCAAACATGCACTCAAACTTATTTCAACTTATAATTCATTTrAAAATTTTCTATAAG 1 4 0 

ATAAAATCAAACATCTAACAATATATTTAAAATACGTCCAAAATCAATTTTTTAACATCAAACCAAACAC 2 1 0 

ACACTTATCCAAATACTTATTACTCACATTATCAATACAAACTTCCAATTTTCATCCAAAAAAAAATACC 280 

AAAACTACCTATCTATCAAACAAC'TTATAAGTTTTATTTATTTTCAATTAGTTTTATAACAAATTA^ 350 

CTCAATAAAAATATTTATTACATTAAAAACTAMCAAATTACTCTATTTTTACTCCTAAAAAAGCTCAAT 620 

AAAACTTA TAAAAATAATCTTCTATCTTCCTATAAATCCTCTTCTTCCTTTTAACAAAAGTACTATTAAA 6 y o 

TAAAJUUWAATATTAATCTCTTCACTATATCAATCATGTTAAAATATTTATTATAjAAAACACACTAACAA 5 60 

AATCTTCTGCCCAAGATTTAAAATTATCCAA'rTCACAAATGTACTCAATACTAACTACTAACTAGTAAGA 630 

AG(^WATCTTCAATATAATMTTATTCATTTTTATCTCCTTTCCTTCTTTCCATCCACGTATATGACAGT 700 

TTCGTTTCCACATTCCACTCTGACIXCGCTA CAACA CATTTATCTATGATCATCTTCGTTTCTTAGTCTT 770 

CCTCCTTM TTATAAA TACCACCAACGCCTGTCCTTCGCA^ 840 

HetLeuClyClyLeuLeuHisArgClyHisLysI leLysClyThrValV i 7 

TACTCTTGCTCCGTTCCAAAGATCCTTCCGGGTCTTCTCCATACCGGTCATAAG ATAAAACGCACACTGC 9 1 0 

ulLcuMetArgLysAsnValLtuAspValAsnSerValThrSerValClyGly LlelleGLyGlnClyLt; 40 

1'CTTWTCCGCAACAATGTCTTGCACGTCAATAGCGTAACCACCCTTGGGGCAATTATTGGTGAAGCTCT 980 

uAspLeuValClySerThrLeuAspThrLeuThrAlaPheLeuClyArgProValSerLeuClnLeuIlc 63 

CCACTTACTTGGCTCAACACTCGATAGTCTTAGTGCCTTCTTGGGCCGACCCGTGTCTCTCCAGCTTATT 1050 

SerAlaThrLysAlaAspA 70 

AGTCCTACCAAAGCTCATGCTTCCTTTTCTTCTTTCTTATTTTrCCTAATTACTTCATCCATCCATGCAT 1120 

GCCTTCAATTCTCACCTATCTTGCTCJVTCAACTCATCCCAACATCTATTACTGTTTTCGTTTAACCCTGT 1 190 

TTCATTTCAATCTATCCTCTCACATACTTCTCCGTTCAACCCACTTTCAATl'CGTCTAAAACATAAACCA 1 2 60 

TACACAGCXTAGATGTTTATAATCATCACGCGTCTCCTCCTTCTGTCTCTTCATTATTTGTTATATGACA J 3 30 

TTTGAATGTTATTAATTATTAGTATAGCATCTGCAGCATTLTCACTCTCTTACCAACUTACCTACGTTTC 1 400 

CACGCACCTCTATAAATGTTTAATTTTTTGTTTATGTGAAATGTGAAACTAACTCATGGTAGAACGCCTT 1470 



crClnVal TyrC 1 yAspHisThrS* 1 rCl nl K'ThrLysG iuHisLeuCluProA«nLeuCluClyLf»uTh 4 16 
GCCMCTCTATCCTGATCATACTAGTCAAATAACCAAAGAACACCTAGAGCCCAACTTAGAACCCCTCAC 3290 



rValAspGlu 

TGTACATCAGGTATATAATATGATAAACTTGTGTTTCCAAATAAMTTTTATTTGATCAATTCTGTTATT 



417 
3360 



TTATCTGTCTACTTTTTATTTTAGATCATGTTTCTATCAATMTMGACTAAATTCTATTTCCTTCGAAA 34 30 

AlallcClnAsnLyaArgLRuPhcLeuLcuClyHisHisAapProIlfMetProTyrLeuArgArgllo 440 

GGCAATTCAAAACAACACATTGTTCCTACTAGGTCATCATGACCCAATCATCCCATATTTCAGGCGAATA 3500 

AsnAlaThrSerThrLysAlaTyrAlaThrArgThrl lcLeuPhcLeuLvsAsnAspG lyThrLeuAreP 464 

AATCCAACCTCCACAAACCCTTATCCTACCAGAACCATCCTTTTCCTGAAAAATCACCGAACTTTAACAC 3570 

roLeuAlalleGluLeuSflrLcuProHisProGlnGlyAspClnScrClyAlaPheSerGlnValPheLp 687 

CACTTGCCATAGACTTGACTTTCCCACATCCTCAGGCAGATCAATCTCCTCCTTTTACTGAACTTTTTCT 3660 

uProAlaAspGluClyValCluSerScrlleTrpLeuLeuAULysAlaTyrValValValAsnAspSer 510 

CCCTGCACATGAAGCTGTTGAAAGTTCTATTTCCCTCCTACCAAAGGCTTATGTAGTTGTGAATCACTCC 3710 

CysTyrHisClnLeuValSerHisTr 3 ] 9 

TGCTATCATCAACTTGTCAGCCATTGCTATACATTTTCTCAACGAAATAATATAAAAATAAATAAACTTA 3 780 

pLeuAsnThrHisAlaValVa 526 

TATATATATATCTAGAAAATGATrATTAATATCTTTTGATCCATAATACGTTAAACACTCATGCACTTGT 3850 

lGluProPhel lei leAlaThrAsnArpllisLeuSerVal ValHisProI leTyrLysLeuLruHisPro 549 

TCAGCCATTCATCATACCMCAAACAGGCATCTCAGTGTTGTTCACCCTATTTATAAACTCCTTCACCCT 3920 

HisTyrArgAspThrMetAsnl lcAsnGlyLeuAlaArgLeuSerLeuVa lAsnAspClyClyVa 1 1 UG 573 

CACTATCGTGACACCATGAACATAAATGCCCTTGCTCGGTTATCACTGGTCAACCACGGTGGCCTTATAC 3990 

luGlnThrPheLeuTrpGlyArgTyrSerValGluMctSerAlaValValTyrLysAspTrpValPhoTh 596 

AACAAACATTTTTCTCGGGAAGGTATTCTCTGGAAATGTCTGCTGTAGTTTACAAGGATTGGCTTTTTAC 406O 

rAs pG 1 nA laLeuP roA laAspLeul leLys Ar 

AGATCMCCATTGCCTGCTGACCTTATAAAAAGGTAAATCATATATCTTACAATACGAAACATGAAACTC 

AATTTAATTTATAGTTAACAAAAAATTGAAGACACTTTTGATGTTATATAATTAATGTCATGTGATATAG 4 200 



607 
4130 



laAsnClyLysGlyLysLeuGly 7 7 

CATATTCCTTTTTACTTTTGTrTTCACTCTTTCTTTTCTTT 1540 

LysAlaThrPheLeuCluClylielleThrSerLeuProThrLeuGlyAlaGlyGlnSerAlaPheLysI 101 

^ AAGCCTACCTTT'rTGGAAGGTATCATTACTTCATTCCCAACTTTGCGACCAGGCCAATCTCCATTCAA 1610 

leAsnPheCluTrpAspAspGlySerGlylleLcuGlyAlaPheTyrlleLyaAiinPheMetClnThrGl 124 

■rTAATTTTGAATGGGATGATCGCACl'GGAATTCTTGGACCATTTTATATCAAGAATT'rTATCCAAA CTGA 1680 

uPhePheLttuValSerLeuThrLfcuCiuAspIleProAsnHisClySerlleHisPheValCysAsnScr 16 7 

G'fT'I^TCCTTGTGAGTlTCACTCTTGAAGACATl'CCAAACCATGCAAGCATCCACTTTGTTTGCAATTCC 1750 

TrpIleTyrAsnAlaLysLeuPheLysSerAspArgllePhePheAlaAsnGIn 165 

TGGATTTaCAATGGCAAACTCTTCAAAAGTGACCGCATTTTCTTI'CCCAACCAGGTCAACTATAATTAAT 1820 



gGlyMetAlalleGluAspProScrCysProHisClyneArgLcuVaUlpGluAspTyrProTyrAla 630 

ACCAATCGCAATTGAGGATCCATCGTGCCCTCATCGGATTCCCCTTGTGATAGAGGACTACCCTTATCCT 4270 

ValAspGlyLpuGlulleTrpAspAlalleLysThrTrpValHisGluTyrValPhcLeuTyrTyrLysS 656 

GTTCATGCACTTCAGATATGGCATGCTATCAAGACATGGGTCCATCAATACCTTTTCTTCTACTACAAAT 4 34 0 

nrAspAspThrl.euArRGluAspProGliiLeuClnAlaCysTrpLysCluLeuValGluValGlyMisGl 677 

CAGATGACACACTTAGAGAAGATCCTGAACTCCAAGCCTGCTCGAAAGAACTCCTAGAGGTGCGTCATGC 44 1 0 

yAspLysLysAsnGluProTrpTrpProLysMetGlnThrArgGluCluLeuValGluAlaCysAlalte 700 

AGACAAGAAAAATGAGCCATGGTCGCCTAAGATGCAAACTCGTGAAGAGCTAGTTGAACCTTGCGCTATC 4480 

IlelleTrpThrAlaSerAlaLeuHisAlaAlaValAsnPheClyGlnTyrProTyrClyClyLeuIleL 724 

ATCATATGGACTGCTTCAGCACTTCATGCAGCTCTTMTTTTCGACACTATCCCTATGGACGTTTAATCT 4550 



GAAACTTAAATATAACTTTTAMTCTTCA^TAACTTTGTTTCTTTTCTCTAGTTATTTCATTTTTCCAAA 1890 
AAAMGTTTTTCTCMTACCCGAGTGTCAATTMTGTGGTTTATTTTCAACTTGTTGAACTTTAAGGCAT 1960 



ThrTyrLeuProSerGluThrProAlaProLeuVa 
(^WTGAAAAAATAAAGTTAACAAAACATATTCCAGACATATCTTCCAAGTCAGACACCAGCTCCACTAGT 



177 
2030 



1 LysTyrArgGluGluGluLeuHi sAsnLeuArgGlyAspGlyThrClyGluArgLvsGluTrpGluArg 200 

caaatatagagaagaagagttgmtaatttaagaggagatggaactcgagaacgcaaagagtgggaaagg 2 1 00 

^ValTyrAspTyrAspValTyrAsnAspLeuGlyAspProAspLysGlyGluAsnHisAlaArgProValL 224 

¥ CTCTATGATTATGATGTCTACAATGATTTAGGTCATCCGGATAAAGGTGAAAATCATGCCCGTCCTCTTC 2170 

euGlyGlyAsnAspThrPhcProTyrProArgArgGlyArgThrGlyArgLysProThrArgLysA 246 

TTGGAGGAAATCACACCTrTCCTTATCCTCGTAGGGCGAGAACTGGTAGAAAACCAACAAGGAAAGGTTA 2240 

CGCTTCTATrrCTATTTTTAAATTACTCTATCACAGACTCAAATCTGTTCTTCAATTCTATAATATTTAT 2 310 

GAGCTCTTAGCATAAATGTACTTTTAATATGTCAAGAGTCATATAACTTTGAATTCGTATATTGTTTTAG 2380 

CTAGAAGAGTTCTACATGTACATCTTAACATAAAGGTrGACTGTTTCTAATCAAGTAATCCTAAATTATC 2450 

Jaaggagtacttacaaacaaccatttcattcaactttmctcttt^ 2520 

spProAsnSerGluScrArgSrrAsnAspVa 256 

TTTTTGTTGCTAATAAGTTCTGCTATTTTCATTAAACAGATCCTAATAGTGAGAGTAGGAGCAATCATGT 2 590 

lTyrLeuProArgAspGluAlaPheGlyHisLeuLysSerSerAspPheLcuThrTyrGlyLcuLysSer 279 

TTATCTTCCAAGAGATGAGGCTTTTGGACACTTCAAGTCATCTGACTTTCTTACTTATGGACTAAAATCC 2660 

ValSerGlnAsnValLeuProLeuLeuGlnSerAlaPheAspL(*uAsnPheTTirProArgGluPheAspA 303 

GTATCTCAAAATGTTCTTCCATTATTGCAATCTCCTTTTGATTTGMTTTCACACCCCGTGAGTTTGATA 2 730 

^rPheAspGluValHisGlyLeuTyrSerGlyGlylleLysLeuProThrAspHelleSerLyslleSe 326 

GCTTTGATGAAGTTCATCGACTCTATTCAGCCGGAATTAACCTCCCAACAGATATAATCAGCAACATTAC 2800 

rProLpuProValLcuLysGlullePheArgThrAspClyCluGlnAlaLcuLysPheProProProLys 349 

TCCACTAGCCGTGCTTAAGGAAATCTTCCGAACTGATGGTGAACACGCCCTTAAGTTTCCTCCTCCTAAA 2870 

VallleOlnV ^ 

GTAATTCA>"GTATCTCATAATATTAAGTACCTGTATCTAAAATGGCGTTACAATTAACAGCATTGGAGT 2940 



euAsnArgProThrLeuSerArgArgPheMntProGluLysClySerAlaGtuTyrGluCluLenArgLy 747 

TAAACCCTCCAACTCTTAGTACGCGATTCATGCCTGAGAAAGGTTCTGCTGAGTATGAGGAGCTGAGGAA 6620 

sAsnProGlnLysAUTyrLpuLysThrlleThrproLysPheGlnThrLeiilleAapLeuSerValllc 770 

GMTCCCCAGAAGGCTTACTTGAAGACTATTACACCAAAGTTTCAGACCCTTATTGACCTTTCTCTTATA 4690 

ClulleLeuSerArgHisAiaSerAspGluValTyrLeuGlyGluArgAspAsnProAsnTrpThrSerA 794 

GAAATCTTGTCAACGGATGCATCTCATGAGCTGTACCTTGGGGAGACGGACAATCCAAATTGGACATCTG 4760 

spThrArgAULeuGluAlaPtieLysArgPheGlyAsnl.ysLeuAlaGlnneGluAsnLysL«uSerGl 817 

ATACMGAGCGTTACAGGCTTTTAAAAGCTTTGCAAATAAACTGGCACAAATTGAGAATAAACTCTCAGA 4830 

uArgAsnAsnAspGluLysLeuArgAsnArgCysGlyProValGlnMetProTyrThrLeuLeuLeuPro 840 

GAGAAACAACGATGAGAAACTGACAAACCGTTGTGGACCACTTCAAATGCCTTATACTCTCCTTrTGCCT 4900 

SerSerLysGluGlyLeuThrPheArgClyl leProAsnSerlleSer IleEnd 857 

TCTAGTAAGGAAGGATTAACTTTCAGAGGAATTCCCAACAGTATCTCTATCTGAAGAGCTTTGTGGTTCC 49 70 

TTAAGTTTTGTGCCTTTGAATAAAAGATAGAAGAGMCGAATCTGAATTTCTTCATCCAACCTGCACATG 5040 

GTAATCTAGTG TGTTTGAGTC C TTTCA TG TTTGG CTTTAATAAAATAAAAGCA GTTCTT rT^TAAAA TG T 5 1 10 

tcagmtcaataaaagcatttactgctttgctttgtgcc'c^aacgttacccatctaacttgaatagaaat 5 1 80 

AAATAATGAGTTGGACAAAACCTACAACCGCTTGATAACCCATTGTTCAAATATAATCCGTAATCTTTGT 5250 

TTTAT 1 1 1 1 1 GTTACAAGAGATATAAGAAACAAAATATATAATGCAAACTTTATTATTAAAATTATATTC 5320 

GCTAAGACCTTCGTGTCTCACCACGAATTTGATTTTCACAATCAAAACGCTCAACCTATATTAATAACAT 5 390 

CACACGACGCAACTAACTACCCAAATATCCTTATTATAACATATAAATGAAAGTTTACAAATTGAAGAGA 54 60 

ACCTTGTGCACGAGGATAACTTTTTTATCATTCCATGCTTTTCCCGTTCATTCTGCATCATATATCTTGC 5530 

AAATTCAAACTCTGTTTCATCGTTCTCGGATTCACTCTATTAACAATCTACATGACATAAAATTACTAAA 5600 

TATAAAAAAGAATAATCCTTGAAGGCCTTAAACTCGAACAAACCTACTCAATTAGAAGATTTTCAACTTT 56 70 

GCGACAATCCTTGAATTTGCAATACATCTTGAATACTCG 5 709 



alScrLvs 355 

CCTAATAATGGATATCTGAATTAATCCTA TTTCA CTAACGATATTTGTTATTATTTGAGCAGTGAGTAAG 30 1 0 

"erAlaTrpMetThrAspGluCluPheAlaArgGluMctLeuAlaGlyValAsnProAsnLeuIleArgC 379 

TCTGCATGGATGACTGAfGMGAATTTGCAAGACAAATCCTTGCTGGTGTAAATCCAAACTTGATTCCTT 3080 

ysLeuLys 381 

GTCTTAACCTTGACTTTTCTTTGCATGTCAACTTTCTGATTGATATCCTTGATTGGCAGGATGACCCAAT 3 1 50 

GluPheProProArgSerLysLeuAspS 391 

CGAGGAGGATCATA TATATTTCCTTATAATTTA TTATTGCACGAGTTCCCTCCACCAAGCAACCTAGATA 3220 
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Fig. 7. Identification of the 5' end of the lipoxygenase gene by 
primer extension and SI protection studies. Labeled DNA frag- 
ments used as hybridization probes were annealed in the presence 
of yeast tRNA (lanes 1A and 1B) or total polysomal RNA (lanes 
2A and 2B), and either extended with reverse transcriptase (A) 
or treated with nuclease SI and the products were analyzed by 
acrylamide gel electrophoresis (B). The autoradiogram in (A) shows 
the chemical degradation products used as size markers, with base- 
specific cleavages indicated above the appropriate lane. Numbers 
indicate the size of fragments in nucleotides 

while no reduction was seen in hybridizations with RNAs 
from other lox nulls. However, B. Axelrod has recently 
isolated a cDNA clone that has the same sequence as pLX- 
65, and identified it as encoding a lox-2 isozyme; this was 
based on a comparison of the deduced amino acid sequence 
with extensive amino acid sequence data obtained from pur- 
ified lox-2 protein (personal communication). The differ- 
ence between the two assignments of lox isozymal type may 
possibly be a consequence of the high degree of homology 
between the lox-1 and lox-2 sequences. The deduced partial 
amino acid sequence of the " lox-1 like" (lox-2) protein 
shows 83% homology with the corresponding sequence of 
lox-1. At the nucleotide level, the coding regions are 89% 
homologous. Cross hybridization of the pLX-65 probe 
(which contained some coding region) with lox-1 mRNA 
in the lox-2 null mutant could have masked the absence 
of the mRNA for lox-2, particularly so if lox-1 mRNA 
levels are substantially higher than those of lox-2 in soybean 
embryos. We are currently planning to address the issue 
of relative lox mRNA levels in soybean embryos through 
the use of gene-specific oligonucleotide primers. 

Additional confirmation that the gene in 3H9.19.1 en- 
codes a lox-3 isozyme comes from a comparison of the 
deduced lox amino acid sequences with the determined ami- 
no acid sequence data obtained by microsequencing of puri- 
fied lox-3 peptides (data provided by B. Axelrod). As can 
be seen in Fig. 8, the deduced lox-3 amino acid sequence 
shows 100% homology with the actual amino acid se- 
quences of four different regions of lox-3. On the other 
hand, regions of dissimilarity with the lox-1 and " lox-1 - 
like" (lox-2) sequences are evident. This finding is consis- 



tent with data presented earlier, showing that the gene spe- 
cific clone p3H9/3' does not hybridize to RNA from a lox-3 
null mutant (Fig. 4). Thus, we are certain we have a gene 
encoding a lox-3 isozyme, although we are unable to deter- 
mine whether we have a lox-3A or lox-3 B isozyme from 
the available data. 

As shown in Fig. 8, the deduced protein sequences of 
lox-3 and lox-1 show 70% homology, while the lox-3 and 
"lox-1 -like" (lox-2) deduced sequences have 73% homolo- 
gy. The corresponding nucleotide sequence homologies over 
the same regions are 74% and 73%, respectively. The se- 
quence of lox-1 indicates that it consists of 838 amino acids 
while the lox-3 sequence encodes 857 amino acids. The 
length difference of 19 amino acids is due, principally, to 
the presence of codons for 17 additional amino acids within 
the first exon of lox-3. The amino acid sequence within 
exon 6 of lox-3 (amino acids 382-417) also shows a high 
degree of divergence from that of lox-1 . Exons 1 and 6 
of lox-3 show roughly 44% and 42% nonhomology, respec- 
tively, with their corresponding regions in the lox-1 se- 
quence, while the remaining exons differ only by 16%— 25%. 

We tried to identify putative functional domains on the 
basis of exon-intron structure, hoping that this might pro- 
vide some insight into the enzymatic characteristics of these 
isozymes, but with the exception of the differences noted 
above, we had no success. For instance, hydropathy analy- 
sis performed on lox-3 revealed no obvious hydrophobic 
lipid binding site(s) in the above locations, although the 
region spanning amino acids from approximately 500 
through 700 has a slightly more hydrophobic nature than 
the remainder of the protein (data not shown). This latter 
region corresponds to exon 8 and a small portion of exon 
9 of lox-3. 

As mentioned earlier, lox-2 and -3 are generally classi- 
fied together as type 2 enzymes, while lox-1 is placed in 
a separate class, designated type 1. The biochemical and 
enzymatic properties of lox-1 differ markedly from those 
of the other two lox isozymes. It is interesting, then, that 
lox-1 and lox-2 have a substantially higher degree of homol- 
ogy than do lox-2 and lox-3. Moreover, a comparison of 
the 3' untranslated regions of cDNA clones for these three 
isozymes reveals that lox-3 and -2 are more closely related 
to each other than either one is to lox-3 (Fig. 9). Although 
the nucleotide sequences of the coding regions of the three 
lox isozymes show a minimum of 70% homology, the 3' 
untranslated regions of lox-1 and lox-2 show only 47% 
and 49% homology, respectively, with lox-3. On the other 
hand, a comparison of this region between lox-1 and -2 
shows 76% homology. Therefore, the 3'-end of lox-3 gene 
is substantially more divergent from the corresponding re- 
gion of the cDNA sequences for lox-1 and -2. In addition, 
the lox-1 and -2 genes not only have a higher degree of 
structural similarity, but they are also genetically linked. 
Soybean lox-1 and -3 segregate independently, but the genes 
for lox-1 and -2 do not (Hildebrand and Hymowitz 1981; 
quoted in Casey et al. 1985). Thus, although the designation 
"lox-1 -like" is a misnomer in a formal sense, it is a cur- 
iously appropriate designation when viewed in light of the 
above considerations. 

We hope a complete characterization of the entire gene 
family will make it possible to address the means by which 
specific lipoxygenase genes are regulated, as well as to de- 
fine how the different isozymes influence the overall physi- 
ology of the plant. 
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Fig 8 Comparison of the deduced amino acid sequences for lipoxygenase isozymes. Sequences are aligned for maximum homology. 
Shaded areas indicate differences. Arrows indicate borders between adjacent exons. Residues determined by microsequencing of lox-3 
peptides are indicated by asterisks. Residues not determined by microsequencing are shown by dashes (-). Determined and deduced 
lox-3 sequences were 100% homologous. The lox-1 sequence is from Shibata et al. (1987). "Lox-l-like" is the designation given by 
Mart etal. (1986) to the clone pLX-65 and includes corrections of their earlier published sequence data (D. Hildebrand, personal 
communication). As discussed in the text, this latter sequence is most probably that of the lox-2 isozyme 
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TACCTTCCAATAAAAAAGATGCAAGACTAGAGATCCgATAAATCTTGCATCTTATCTAA 
TACCTTCCAATAAAAAAGATGCAAGGCTAGAGATCCAATAAATCTTGCAQcgTATCTAA 
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TCTTTCMQTATCQrCTCrn 



frTTAA TjAAAAfr AAj 
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TAAjAlAGC 
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Fig. 9. Comparison of 3; untranslated regions of mRNAs encoding different lipoxygenase isozymes. The first nucleotide shown follows 
the stop codon of each isozyme. The "lox-l-like" sequence has been corrected as described in Fig. 8. Shaded areas reflect differences 
between the sequences 
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A Baculovirus Blocks Insect Molting by Producing 
Ecdysteroid UDP-Glucosyl Transferase 

David R. O'Reilly a nd Lois K. Miller 

The predicted amino acid sequence of a newly identified gene of the insect baculovirus 
Autographa californica nuclear polyhedrosis virus was similar to several undine 5'- 
diphosphate (UDP)-glucuronosyl transferases and at least one UDP-glucosyl transfer- 
ase. Generic and biochemical studies confirmed that this gene encodes an ecdysteroid 
UDP-glucosyl transferase (egt). This enzyme catalyzes the transfer of glucose from 
UDP-glucose to ecdysteroids, which are insect molting hormones. Expression of the 
egt gene allowed the virus to interfere with normal insect development so that molting 
was blocked in infected larvae of fall armyworm (Spodoptera frugiperda). 

hypermutable region in serially propagated 
viruses (2). The sequence of this region 
revealed an open reading frame that could 
encode a 57-kD polypeptide of 506 amino 
acids (Fig. 1). The egt product shares 21 to 
22% amino acid sequence identity with 
several mammalian UDP-glucuronosyl 
transferases, also shown in Fig. 1. In mam- 
mals, the UDP-glucuronosyl transferases 
catalyze the transfer of glucuronic acid to a 
wide variety of exogenous and endogenous 
lipophilic substrates (J). This conjugation 
reaction is of critical importance in the 
detoxification and safe elimination of a mul- 
titude of drugs and carcinogens. In addition, 
the normal metabolism and disposal of vari- 
ous endogenous compounds, such as biliru- 
bin and steroid hormones, proceed through 
their conjugation with glucuronic acid. 
Available evidence on insect systems indi- 
cates that sugar conjugation reactions of this 
type involve glucose rather than glucuronic 
acid transfer (4). No sequences of UDP- 
glucosyl transferase genes were available in 
GenBank at the time of our search, but the 



BACULOVTRUSES CONSTITUTE A LARGE 
group of DNA-containing viruses 
that infect only invertebrate hosts. 
These viruses, many of which infect pest 
iepidoptcran species, are of particular inter- 
est because of their potential as biological 
control agents (?). Studies of Autographa 
californica nuclear polyhedrosis virus 
(AcMNPV), the model system for baculo- 
virus research, are revealing the molecular 
mechanisms by which the virus implements 
its replication strategy. Most of the genes 
identified to date are involved in the interac- 
tion of the virus with its host cell, but little is 
known of the molecular aspects of infection 
at the organismal level. We now report that 
AcMNPV has a gene that allows the virus to 
manipulate the hormonal regulation of de- 
velopment of its larval host. 

The region of the AcMNPV genome con- 
taining the egt gene, spanning 8.4 to 9.6 
map units, first came to our attention as a 



Table 1. Substrate specificity of egt gene product. 
Substrates were incubated in the presence of 
medium derived from appropriately infected cells 
and 0.05 uCi UDP-[U- f4 C)glucose (312.5 mCi/ 
mmol) for 1 hour at 37°C All substrates were 
used at a concentration of 1 mM. Oher condi- 
tions were as described in the legend to Fig. 2. 
Amounts of glucose transferred were calculated 
after scintillation counting of the appropriate 
regions of the chromatography plates. A hyphen 
indicates that less than 1% of the glucose was 
transferred. 
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sequence of a UDP-glucosyl transferase 
gene from Zea mays (maize) was subse- 
quently reported (5). The COOH-terminal 
portion of this protein also displays homolo- 
gy to egt and to mammalian UDP-glucur- 
onosyl transferases (Fig. 1). 

Mammalian UDP-glucuronosyl transfer- 
ases are known to be membrane-bound, and 
the amino acid sequences of these proteins 
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contain both a putative signal sequence at 
the NH2-terminus and a halt-transfer se- 
quence at the COOH-terminus (6-8). Al- 
though the predicted amino acid sequence 
of egt contains a putative signal sequence, 
the protein is approximately 30 amino acids 
shorter than the mammalian enzymes at the 
COOH-terminus (Fig. 1) and lacks the resi- 
dues that constitute the halt-transfer se- 
quence. These facts suggested the possibility 
that the egt protein is secreted. 

The predicted amino acid sequence of the 
AcMNPV egt protein suggested to us that 
thc enzyme might conjugate and eliminate 
some compound in the insect hemolymph. 
As in mammals, a wide variety of both 
exogenous and endogenous substrates are 
prone to conjugation in insect systems (4). 
Recendy, an ecdysteroid-glucose conjugate 
was reported (9). Therefore, several ecdys- 
teroids were included among the com- 
pounds tested as potential substrates for egt- 
mediated conjugation (Fig. 2). 

Spodoptera Jrugiperda (fall armyworm) cells, 
which serve as hosts for AcMNPV, were 
infected with wild-type (wt) AcMNPV L-l 



(10) or the recombinant virus vEGTZ. This 
mutant virus differs from wt AcMNPV only 
in the disruption of the egt gene by the 
insertion of the Escherichia coli p-galacto- 
sidase gene (Fig. 1). Twelve hours after 
infection, lysates or media were incubated in 
the presence of [ 3 H]ecdysone and either 
UDP-glucose (Fig. 2) or UDP-glucuronic 
acid. Samples were then analyzed for ecdy- 
sone derivatives with altered polarities by 
thin-layer chromatography. A novel ecdysone 
derivative (EG) was observed only when the 
lysate or extracellular medium of wt 
AcMNPV-infected cells was used (Fig. 2A). 
When UDP-[U- l4 C]glucose was included in 
the reaction mix, it was possible to detect both 
3 H and I4 C in the product (Fig. 2B), corifirm- 
ing that the conjugation reaction involves the 
transfer of glucose from UDP-glucose to ec- 
dysone. The ecdysteroid UDP-glucosyl trans- 
ferase activity was found predominantly in the 
extracellular medium. No ecdysteroid-conju- 
gating activity was produced by mock-infect- 
ed or vEGTZ-infeaed cells. 

Although several ecdysteroids could func- 
tion as substrates for this enzyme, none of 



Rg. 1. Similarity of egt 
gene product to several 
UDP-glucuronosyl trans- 
ferases and a UDP-glu- 
cosyl transferase. The 
predicted amino acid se- 
quence of egt (E) (13) 
was compared to human 
(H) (<5), mouse (M) (7), 
and rat (R) (8) UDP- 
glucuronosyl transfer- 
ases bv the use of the 
FASTP algorithm (14) 
as implemented by Inter- 
national Biotechnolo- 
gies, Inc. The homology 
of a plant UDP-glucosyl 
transferase (Z) (5) to die 
above proteins is also 
displayed. Uppercase let- 
ters denote exact match- 
es; lowercase letters cor- 
respond to substitutions 
that occur frequendy 
among related proteins 
{15). Substitutions that 
occur in frequendy are 
indicated by a dot. Gaps 
in a sequence are desig- 
nated by hyphens; a car- 
et marks where an amino 
acid has been deleted 
from the sequence. The 
amino acid sequence be- 
tween the vertical arrows 
was replaced by the 0- 
galactosidase gene in 
vEGTZ. 
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the other compounds tested were conjugat- 
ed to a significant extent (Table 1). The 
ecdysteroids ecdysone, 20-hydroxyecdy- 
sone, and makisterone A were conjugated 
only in the presence of medium from wt- 
infected cells when UDP-glucose was in- 
cluded in the reaction mix. They were not 
conjugated when UDP-glucuronic acid was 
used (11) or when they were incubated in 
the presence of medium from vEGTZ (Ta- 
ble 1) or mock- infected cells (11). Thus, we 
conclude that the product of the egt gene is 
an ecdysteroid UDP-glucosyl transferase 
that is secreted into the extracellular medium 
by wt virus-infected cells. 

Hemolymph titers of ecdysteroids fluctu- 
ate in a cyclic fashion to regulate both larval- 
larval and larval-pupal molts (12). Since 
glucose conjugation is suspected to act as an 
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Fig. 2. AcMNPV-specific ecdysone UDP-gluco- 
syl transferase activity. (A) Spodoptera Jrugiperda 
cells were infected with wt AcMNPV (wt) or 
vEGTZ at a multiplicity of infection of 20. 
Twelve hours later, the cells (C) and overlying 
mcdium (M) were harvested separately; the cells 
were lyscd in tissue culture fluid by several strokes 
of a Dounce homogenizer. Mock- infected cell 
cultures were treated in parallel. Thc enzymatic 
assay was modified from fiansal and Gessner {16). 
The standard incubation mixture included the 
following: cell lysatc containing 10 ng of total 
protein or mcdium from thc equivalent number 
of cells; 10 mAf MgCl 2 ; 10 mAf tris maleate, pH 
7.4; 1 mAf unlabeled UDP-glucose (Sigma 
Chemical); 100 \xM unlabeled ecdysone (Sigma 
Chemical); and 0.25 jiCi [ 3 H]ccdysonc (DuPont 
Biotechnology Systems). Thc final reaction vol- 
ume was 50 Reactions were carried out for 5 
min at 37°C, then stopped by thc addition of two 
volumes of cthanol. Products were evaporated 
and resuspended in 60% cthanol. (B) Cell lysarcs 
from wt- infected cells were assayed as described 
above except that thc concentration of UDP- 
glucose was reduced to 3.16 jjuVf and thc reaction 
was allowed to proceed for 30 min. The UDP- 
glucose used was cither unlabeled (lane 1) or 
UDP-[U- M C]glucosc.(0.05 uCi; DuPont Bio- 
technology Systems) (lane 2). [ 3 H] Ecdysone was 
included in both reactions. Products were separat- 
ed by thin-layer chromatography on silica-gel 
plates (Merck) as described (16). [ 3 H]Ecdvsone 
([ 3 H]E) and UDP-[U- ,4 C]giucose (["C]G) 
were also subjected to chromatography. Autora- 
diographs of the silica-gel plates are presented. 
Scintillation counting confirmed the presence of 
both 3 H and l4 C in the ecdysonc-glucose conju- 
gate in (B), lane 2. The positions of unconjugated 
(E) and conjugated (EG) ecdvsone, as well is 
UDP-glucose (UDPG), are indicated. 
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ccdystcroid inactivation mechanism (9), wc 
tested whether AcMNPV infection, through 
the expression of egr, would disrupt the 
normal developmental process of the host 
insect. Sixteen newly ecdysed fourth instar 
S. fiugiperda larvae were infected by injection 
either with wt AcMNPV or vEGTZ and 
monitored daily for any perturbations in 
their development. One cohort of 16 larvae 
was injected with tissue culture fluid as a 
negative control. All larvae injected with 
tissue culture fluid molted to fifth instar as 
expected. Only 1 of 16 larvae infected with 
wt virus made this transition. In contrast, all 
larvae infected with the mutant vEGTZ 
underwent a fourth-to-fifth instar molt. 
Thus, it is clear that egt expression by wt 
AcMNPV specifically inhibits host molting. 

Both infected groups of larvae subse- 
quently succumbed to the viral infection, 
showing that disruption of egt did not pre- 
vent vEGTZ from completing its infectious 
cycle. Using newly ecdysed fifth instar lar- 
vae, wc have observed that no wt- infected 
larvae showed any signs of pupation, where- 
as the majority of vEGTZ-infected insects 
displayed several behavioral modifications 
(feeding cessation, wandering, and spin- 
ning) characteristic of an impending larval- 
pupal molt. However, all virus- infected ani- 
mals died before pupation. 

The data show that the baculovirus 
AcMNPV specifically interrupts the normal 
development of its insect host by producing 
an ecdysteroid UDP-glucosyl transferase, 
which is possibly secreted into the hemo- 
lymph by infected cells. Our observations 
provide the first genetic and developmental 
evidence in support of the hypothesis (9) 
that glucose conjugation by UDP-glucosyl 
transferases is a mechanism for ecdysteroid 
inactivation. AcMNPV egt is the first identi- 
fied gene encoding a glucosyl transferase 
capable of conjugating ecdysteroids. The 
identification of an insect virus gene encod- 
ing such an enzyme should gready facilitate 
the study of equivalent insect genes. 

It is probable that other baculoviruses and 
other insect viruses also interfere with the 
development of their hosts by altering hor- 
monal regulation. A more detailed analysis 
of the regulation of AcMNPV egt expression 
in the insect, as well as a closer study of the 
hormonal changes brought about by viral 
infection, will lead to a deeper understand- 
ing of the significance of this activity to the 
viral life cycle. In particular, such studies 
should reveal the evolutionary advantage 
gained by viruses expressing an ecdysteroid 
UDP-glucosyl transferase. 
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T Cells Against a Bacterial Heat Shock Protein 
Recognize Stressed Macrophages 
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Heat shock proteins are cvolutionarily highly conserved polypeptides that are pro- 
duced under a variety of stress conditions to preserve cellular functions. A major 
antigen of tubercle bacilli of 65 kilodaltons is a heat shock protein that has significant 
sequence similarity and cross-reactivity with antigens of various other microbes. 
Monoclonal antibodies against this common bacterial heat shock protein were used to 
identify a molecule of similar size in murine macrophages. Macrophages subjected to 
various stress stimuli including interferon-? activation and viral infection were 
recognized by class I— restricted CD8 T cells raised against the bacterial heat shock 
protein. These data suggest that heat shock proteins are processed in stressed host cells 
and that epitopes shared by heat shock proteins of bacterial and host origin arc 
presented in the context of class I molecules. 



EXPOSURE OF CELLS TO VARIOUS 
stress conditions leads to the synthe- 
sis of a family of polypeptides termed 
heat shock proteins (hsp's) (1). After immu- 
nization with tubercle or leprosy bacilli, a 
major fraction of T cells and antibodies is 
directed against the 65 -kD hsp, a newly 
discovered member of the hsp family (2-4). 
This hsp is a homolog of the GroEL hsp of 
60 kD in Escherichia coli and shares some 
degree of homology with the E. coli DnaK 
hsp of 70 kD, which is a homolog of the 
mycobacterial 71-kD hsp (2). Pathogenic 
mycobacteria preferentially live and replicate 
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inside macrophages, and it can be assumed 
that the stress imposed by activated macro- 
phages induces abundant hsp synthesis in 
their intracellular parasites, which will then 
result in a strong immune response to this 
antigen. T cells with specificity for the 65- 
kD hsp are not only demonstrable in pa- 
tients but also in many healthy individuals, 
and a great variety of microbes are known to 
have similar hsp's (2, 3). Indeed, hsp's have 
been highly conserved in evolution, and in 
some cases similar molecules are shared by 
prokaryotic and eukaryoric cells (/). Our 
study reveals that a homolog of the myco- 
bacterial 65-kD hsp is present in murine 
macrophages. This molecule is recognized 
by T cells raised against the mycobacterial 
65-kD hsp and could serve as a target for 
protective and autoreactive immune re- 
sponses. 

Lysates of bone marrow-derived macro- 
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INHERITANCE OF CYCLIC HYDROXAMATES IN Zea mays L. 
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Dunn, G. M., Long, B*. J. and Routley, D. G. 1981. Inheritance of cyclic 
hydroxamates in Zea mays L. Can. J. Plant Sci; 61: 583-593. 

Hydroxamic acids have been implicated in the resistance of corn (Zea mays L.) to both 
fungi and insects. In this study, five selected crosses were used among the four inbreds 
BxBx, bxbx, B49 and B37 to study inheritance of hydroxamates. Hydroxamate concen- 
tration in the parental, Fj , F 2 and backcross generations for each cross was estimated by 
a rapid procedure based upon the colori metric reaction of hydroxamates with FeCl 3 . 
Components of variance and estimates of heritability were obtained by the procedures of 
Warner (1952). F 2 and backcross data indicated that concentration of hydroxamates is 
controlled monogenically in the cross bxbx x BxBx and polygenically in the crosses 
bxbx x B49 and bxbx x B37. Estimates of gene number using the Castle-Wright 
formula indicated that hydroxamate concentration is conditioned in B49 and B37 by five 
and two loci, respectively. The addition of BxBx to either B49 or B37 increased the 
frequency of genotypes in F 2 with a high concentration of hydroxamates. Additive 
genetic variance was the most important component of the phenotypic variance and 
resulted in estimates of heritability from 0.64 to 0.79. However, the dominance 
component of variance was considerably higher for crosses involving BxBx than for the 
crosses bxbx x B49 and bxbx x B37. 

Les acides hydroxamiques semblent jouer un role dans la resistance du rnais {Zea mays 
L.) aux champignons cryptogames et aux insectes. Dans Tetude decrite ci-apres, nous 
avons examine le mode de transmission hereditaire de ces substances sur cinq croise- 
ments realises entre quatre lienees consanguines: BxBx, bxbx, B49 et B37. La concen- 
tration en hydroxamates chez les lignees parentales. les F, et F 2 et les retrocroisements 
impliques dans chaque croisement a ete estimee par une methode rapide utilisant la 
reaction colorimetrique des hydroxamates au FeCl v L'analyse de la variance et les 
estimees de Theritabilite ont ete calculees par les methodes de Warner (1952). Les 
donnees obtenues sur les F 2 et !es retrocroisements montrent que la concentration en 
hydroxamates est modulec par un seul gene dans le croisement bxbx x BxBx et par des 
genes multiples dans les croisements bxbx x B49 et bxbx x B37. Les estimees du 
nombre de genes, etablies par la formule de Castle-Wright, revelent que ce caractere est 
gouverne par cinq loci dans B49 et par deux dans B37. L'adjonction de BxBx a B49 ou 
B37 a donne lieu a une frequence accrue de genotypes F 2 a forte teneur en hydroxamates. 
La variance genetique additive etait la composante la plus importante de la variance 
phenotypique et a produit des estimees de fheritabiiite allant de 0.64 a 0.79. Toutefois, 
Telement de dominance dans la variance etait considerablement plus eleve pour les 
croisements comportant BxBx que pour les croisements bxbx avec B49 ou B37. 



The cyclic hydroxamate 2,4-dihydroxy-7- 
methoxy-1, 4 benzoxazin-3-one (DIMBOA) 
was first reported in corn (Zea mays L.) and 
wheat (Triticum aestivum L.) (Wahlroos and 
Virtanen 1959). Subsequent studies have im- 

Can. J. Plant Sci. 61: 583-593 (July 1981) 



plicated the hydroxamates in resistance to 
stalk rot (BeMiller and Pappellis 1965), 
northern corn leaf blight (Anglade and Molot 
1967; Long et al. 1975), European corn borer 
(Klun et al. 1967), and corn leaf aphid (Long 
et al. 1977). 
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Monogenic and polygenic control have 
been hypothesized for accumulation of 
hydroxamates in corn. Hamilton (1964) 
studied the inheritance of hydroxamates in 
corn seedlings. The presence of D1MBOA or 
its glucoside was indicated by a blue-colored 
chelate formed when a root tip was crushed 
on filter paper, impregnated with 0.1 M 
FeCL 3 . He obtained a 3: 1 ratio in F 2 for high 
versus low hydroxamate seedlings. This gene 
conditioning production of DIMBOA was de- 
signated as BxBx (Couture et al. 1970). Klun 
et al. (1970) presented evidence for quantita- 
tive inheritance of DIMBOA. General com- 
bining ability accounted for 9 1 % of the varia- 
tion among hybrids in DIMBOA concentra- 
tion. 

Irie objectives of this study were to esti- 
mate gene number controlling concentration 
of hydroxamates in selected corn inbreds, to 
determine its heritability, and to estimate 
components of variance in five crosses. 

MATERIALS AND METHODS 

The following inbreds were used in this study: B49 
— high DIMBOA line, polygenic; B37 — low 
DIMBOA line, polygenic; BxBx — high 
DIMBOA line, monogenic; and bxbx — low 
DIMBOA line, monogenic. The first two inbreds 
were obtained from Dr. A. R. Hallauer, Iowa 
State University. The latter two lines were 
obtained originally, as genetic stocks from Dr. 
R. H. Hamilton, Pennsylvania State University 
and have since been selfed for six to seven genera- 
tions each. 

Five separate crosses were made: bxbx x BxBx; 
bxbx x B49; bxbx x B37; B49 x BxBx; and B37 
x BxBx. Each parent (P ( , P 2 ) and the correspond- 
ing F|, F 2 , BC, (F, x P,)andBC 2 (F 1 x P 2 )were 
analyzed for concentration of hydroxamates using 
the rapid procedure (Long et al. 1974). 

Approximately 50 plants of each parent and F, , 
100 plants of each backcross and 250 plants in each 
F 2 were analyzed for hydroxates. All plants were 
grown in the greenhouse at approximately 15° C 
(night) and 25°C (day) on a 16-h photoperiod to 
eliminate gross environmental effects. 

The rapid procedure consisted of removing 0.1- 
to 0.2-g sections of tissue from the lower stems of 
seedlings 36-41 cm in height (5th to 6th leaf stage). 
The samples were placed in plastic vials and frozen 



overnight. Subsequent thawing of the tissue 
allowed the B-glucosidase to hydrolyze the 
glucoside and release DIMBOA. Crude extracts 
were prepared by crushing the tissue with mortar 
and pestle in 1 .0 mL of a solution containing equal 
volumes of 95% ethanol and 0.1 N HC1. The 
extract was centrifuged and decanted into a cuvette 
to which was added 0.1 mL of 0.1 N FeCl^. 
Absorbance values were obtained and concentra- 
tions determined from a standard curve prepared 
from purified DIMBOA. 

The rapid procedure based upon the reaction to 
FeCl 3 is not specific for DIMBOA. Although 
DIMBOA is by far the major hydroxamate in Zea 
mays, it is likely that other related hydroxamates 
are present in trace quantities. For this reason, the 
data are expressed as total hydroxamate concentra- 
tion rather than as concentration of DIMBOA. 

Estimates of genetic effects and components of 
genetic variance were obtained using the proce- 
dure outlined by Warner (1952). The notation has 
been modified as indicated in the text. In this 
study, the environmental variance was obtained 
from the average of P ( , P 2 and F] variances since 
each was considered genetically uniform. The 
additive component of variance 12 V A , was esti- 
mated by multiplying V F; by two and subtracting 
from it the combined backcross variances, thus 
eliminating the dominance 14 V D and environ- 
mental components V E . By subtracting the additive 
and environmental variances from the total 
phenotypic variance of the F 2 , an estimate of the 
dominance variance was obtained . The deg ree of 
dominance was calculated as V4V D 2V A . Esti- 
mates of heritability (narrow sense) were calcu- 
lated according to the equation h 2 = V2V A ( l /i V A + 
Va v d + v E ). Estimates of gene number were calcu- 
lated according to Casters formula /V = £> 2 /[8(V r F , 
- V Fi )] where /V = gene number, D = the differ- 
ence between the parental means, and V F) and V F; 
= the phenotypic variances of the F| and F 2 , 
respectively. All values were transformed to 
logarithms in order to satisfy the assumption of 
scaling upon which the analysis is based. The 
frequency distributions are based on the log of the 
hydroxamate concentration whereas the range and 
mean values in Table 1 are given in terms of mg 
hydroxamates/g fresh weight. Additional details 
are given elsewhere (Long 1977). 

RESULTS AND DISCUSSION 

Ranges, means and variances for the parents 
and for five crosses among them are given in 
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Table J . The parents differed widely in con- 
centrations of hydroxamates, with bxbx 
(monogenic) and B49 (polygenic) repres- 
enting the extremes. 

In the cross bxbx X BxBx, the average 
concentrations of hydroxamates in F 2 and F 2 
exceeded the mid-parental value of 0.538 
mg, indicating the effects of dominance. The 
backcrosses produced concentrations nearly 
intermediate to the parental values. Variances 
for concentration of hydroxamates were con- 
siderably larger for segregating than for non- 
segregating generations. 

The frequency distributions (Fig. 1) indi- 
cated incomplete dominance for this trait in 
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this cross. Distribution in the F 2 appeared to 
be trimodal. When concentration of hydroxa- 
mates was grouped into discrete classes, a 
good fit was obtained to a 1 :2: 1 ratio in F 2 and 
to a 1:1 ratio in the backcrosses. F 3 data 
tended to support the hypothesis that a single, 
partially dominant gene conditions much of 
the hydroxamates in BxBx. 

The F, and F 2 means were fairly close to 
the mid-parental value of 0.595 mg in the 
cross bxbx x B49. Mean concentrations of 
hydroxamate in each backcross closely 
approximated the mid-parental values of 
0.369 and 0.884 mg for BC, and BC 2 , respec- 
tively. Frequency distributions for F 2 , BC, 



Table I. 



Ranges, means and variances for hydroxamate concentration in parents and progenies 
from five crosses in Zea maxs 



Population 



No. of plants 



Range 



Mean 
(mg/g fresh wt) 



Variancet 



Parents 

bxbx 

BxBx 

B37 

B49 
Crosses 

bxbx x BxBx 

F, 

F 2 

F, 

BC,t 
BC 2 $ 

bxbx x B49 

F, 

F 2 

BC, 

BC 2 

bxbx x B37 

F, 

F 2 

BC, 

BC 2 

B49 x BxBx 

F, 

F 2 

BC, 

BC 2 

B37 x BxBx 
•F| 
F 2 
BC, 
BC 2 



51 

48 
50 
49 



50 
241 
133 
113 
* 107 

50 
245 

96 
104 

50 
239 

97 
107 

50 
242 
102 
104 



50 
241 
103 
105 



0.022-O.I33 
0.753-1.172 
0.254-0.441 
0.843-1.271 



0.081 
0.996 
0.333 
1. 110 



tOriginal data were transformed to logarithms for calculation of variances 
?In each cross, BC, is backcross of F, to female parent (first parent listed), 



0.000149 
0.000606 
0.000241 
0.000406 



0.533-0.901 


0.663 


0.000448 


0.002-1.142 


0.556 


0.008601 


0.077-1.110 


0.552 


0.008320 


0.011-0.878 


0.389 


0.008292 


0.578-1.198 


0.812 


0.002080 


0.454-0.811 


0.658 


0.000360 


0.164-1.010 


0.545 


0.002314 


0.122-0.677 


0.382 


0.001850 


0.619-1.100 


0.864 


0.000830 


0.164-0.293 


0.232 


0.000230 


0.064-0.444 


0.201 


0.000682 


0.013-0.384 


0.149 


0.000431 


0.123-0.446 


0.288 


0.000464 


0.962-1.263 


1.069 


0.000365 


0.296-1.373 


0.884 


0.003660 


0.351-1.424 


0.913 


0.002976 


0.636-1.409 


0.931 


0.001910 


0.562-0.844 


0.729 


0.000234 


0.154-1.053 


0.552 


0.004212 


0.263-1. 1 1 1 


0.645 


0.003460 


0.500-1.206 


0.816 


0.001856 



and BC 2 is backcross to male parent. 
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Fig. 1. Frequency distributions for the parental, F„ F 2 , BQ and BC 2 populations for hydroxamate 
concentration from the cross bxbx X BxBx. 
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and BC 2 were continuous and seemed to fol- 
low a normal distribution curve (Fig. 2). It 
was not possible to divide concentration of 
hydroxamates in F 2 of bxbx x B49 into dis- 
crete classes as was the case for bxbx x 
BxBx. From these data it appears that concen- 
tration of hydroxamates is controlled in bxbx 
x B49 by several genes having a small but 
cumulative effect. It is hypothesized that B49 
does not possess any one gene with an effect 
comparable to BxBx. Estimates of gene num- 
ber using the Castle-Wright formula indi- 
cated that five loci condition concentration of 
hydroxamates in B49. 

Both parents had low concentration of 
hydroxamates in the bxbx x B37 cross. Con- 
centrations of hydroxamates in F, and F 2 
were close to the mid-parental value of 0.207 
mg. Backcross means were also close to the 
mid-parental values. Variances in F 2 , BC H 
and BC 2 were considerably larger than for 
parental and F| generations, but smaller than 
those for bxbx x B49. These lower variances 
in the bxbx x B37 cross probably reflected 
the level of hydroxamates in the parents. 

As in the bxbx x B49 cross, the frequency 
distributions for the F 2 , BQ and BC 2 popula- 
tions were continuous and normally distrib- 
uted (Fig. 3). The data indicated that content 
of hydroxamates in B37 is controlled by 
genes with little or no dominance. The esti- 
mate of gene number indicated that two loci 
condition concentration of hydroxamates in 
B37. 

In the cross B49 x BxBx, both parents 
exhibited high mean concentrations of 
hydroxamates. Mean concentration in the F t 
was nearly intermediate to that of the parents. 
The frequency distribution in F 2 was highly 
skewed to the right (Fig. 4). This departure 
from normality may reflect the influence of 
BxBx on the polygenic expression of 
hydroxamate concentration in B49. 

Backcrosses of the F, to either parent re- 
sulted in similar mean concentrations of 
hydroxamates. The distribution for BCj (F] 
x B49) clearly exhibited two modes, while 
in BC 2 (F] x BxBx) the two modes were less 
discernible. The bimodal nature of the back- 



cross distributions can be explained in part by 
considering the Bx gene alone. Since B49 
probably does not possess the Bx gene, back- 
crosses of the F| to the two parents would 
result in the expected genotypes BxBx:Bxbx 
for the Fj backcrossed to BxBx and BxBx: 
bxbx for the F| backcrossed to B49. The addi- 
tion of polygenes from B49 plus environmen- 
tal effects would tend to blur these distinct 
differences. 

In the B37 x BxBx cross, the relatively 
large deviation of the Fi mean from the mid- 
parental value of 0.664 mg indicates the 
effects of dominance. As in the previous 
cross, the F 2 frequency distribution appeared 
highly skewed to the right (Fig. 5). The effect 
of the Bx gene was shown by the departure of 
the F 2 distribution from normality. Similar 
trends were apparent in the two backcross 
populations. The frequency distribution for 
both BCj and BC 2 were bimodal. 

Components of variance for concentrations 
of hydroxamates in the five crosses are shown 
in Table 2. The phenotypic variances were 
considerably higher for crosses involving 
BxBx than for those with bxbx. This was prob- 
ably a reflection of the dominance effects of 
BxBx which resulted in a higher frequency of 
extreme genotypes in the F 2 populations. 
Similarly, the smaller variances shown by the 
crosses bxbx x B49 and bxbx x B37 reflect 
the normal distribution curves in which the 
' majority of individual plants were clustered 
about the means. 

The environmental variances for each 
cross were small in comparison to the total 
phenotypic variances. Genetic variance 
therefore accounted for the majority of the 
phenotypic variation for this trait. In all cros- 
ses the additive genetic variance was con- 
siderably higher than the dominance variance 
and accounted for the majority of the genetic 
variation. However, dominance variation 
was proportionally higher in the crosses in- 
volving BxBx. 

Estimates of heritability, degree of domi- 
nance and percent dominance for concentra- 
tion of hydroxamates in the five crosses are 
shown in Table 3. Estimates of heritability 
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Table 2. Components of variance for hydroxamate 
concentration in selected crosses among four maize 
inbreds 



Cross t 


Vp 




v D 




bxbx x BxBx 


8.601$ 


6.830 


1.370 


0.401 


bxbx XB49 


2.314 


1.950 


0.059 


0.305 


bxbx x B37 


0.682 


0.468 


0.007 


0.210 


B49 x BxBx 


3.660 


2.434 


0.767 


0.459 


B37 x BxBx 


4.212 


3.104 


0.748 


0.360 



tOriginal data transformed to logarithms for analysis. 
$A11 values x 10~ 3 . 

ranged from 0.64 to 0.79 and reflect the high 
proportion of additive genetic variation to 
total phenotypic variation. Based upon these 
estimates, selection for concentration of 
hydroxamates by nearly any breeding method 
should be effective in these crosses. 

It is apparent from Table 3 that crosses 
involving BxBx exhibited a much higher de- 
gree of dominance than in the crosses bxbx X 
B49 and bxbx x B37. it is hypothesized that 
the inbreds B49 and B37 do not possess any 
gene or genes with dominance effects compa- 
rable to BxBx. 

Similarly, percent dominance values were 
comparably higher for crosses involving 
BxBx than for crosses with bxbx. This is a 
reflection of the higher proportion of domi- 
nance variation to total phenotypic variation 
shown by the BxBx crosses. 

These data support the hypothesis that con- 
centration of hydroxamates in BxBx is con- 
ditioned primarily by a major, partially 
dominant gene, while the concentration in 
B37 and B49 appears to be conditioned by 



Table 3. Estimates of heritability, degree of domi- 
nance, and percent dominance for hydroxamate concen- 
tration in selected crosses among four maize inbreds 









Percent 








dominance to 






Degree of 


the total 


Crosst 


Heritability 


dominance 


variation 


bxbx x BxBx 


0.79 


0.63 


16.71 


bxbx x B49 


0.64 


0.25 


2.95 


bxbx x B37 


0.68 


0.17 


1.47 


B49 x BxBx 


0.67 


0.79 


23.96 


B37 x BxBx 


0.74 


0.69 


19.42 



tOriginal data transformed to logarithms for analysis. 



genes with smaller effects. A recurrent selec- 
tion program is in progress to increase con- 
centration of hydroxamates in corn utilizing 
both monogenic and polygenic factors. 
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Claim 66 
Mannitol 



Stress Protection of Transgenic Tobacco by 
Production of the Osmolyte Mannitol 

Mitchell C. Tarczynski,* Richard G. Jensen, Hans J. Bohnert 

The accumulation of sugar alcohols and other low molecular weight metabolites such as 
proline and glycine-betaine is a widespread response that may protect against environ- 
mental stress that occurs in a diverse range of organisms. Transgenic tobacco plants that 
synthesize and accumulate the sugar alcohol mannitol were engineered by introduction of 
a bacterial gene that encodes mannitol 1 -phosphate dehydrogenase. Growth of plants from 
control and mannitol-containing lines in the absence and presence of added sodium 
chloride was analyzed. Plants containing mannitol had an increased ability to tolerate high 
salinity. 



Drought, low temperature, and high salin- 
ity are environmental factors that may dra- 
matically limit plant growth and crop pro- 
ductivity (I). In response to these abiotic 
stresses, which all disturb the intracellular 
water balance, many plants and bacteria 
synthesize and accumulate osmotically ac- 
tive, low molecular weight compounds such 
as sugar alcohols, proline, and glycine- 
betaine (2, 3). Collectively, these com- 
pounds have been referred to as osmolytes, 
osmoprotectants, or compatible solutes. Al- 
though their exact function in plants is 
unknown, numerous studies [(2) and refer- 
ences therein] suggest these osmolytes may 
protect the plant from abiotic stress. For 
example, osmolytes accumulate in plant 
cells in response to water or salinity stress 
and are subsequently degraded or lost after 
stress relief (2). Other studies indicate a 
macromolecular protective effect: in vitro 
incubation of osmolytes with protein ex- 
tracts from plants often alleviates the ad- 
verse effects of electrolytes and temperature 
stress on enzymatic activity (4-7) . Howev- 
er, several "nonprotective" roles for many 
of these compounds have also been suggest- 
ed (8—1 1), such as storage products during 
stress. Additionally, accumulation of these 
compounds may be the result of a stress- 
induced metabolic impairment (12). 

An example of such an osmolyte is the 
sugar alcohol mannitol, which occurs wide- 
ly in plants and animals (2, 3). We previ- 
ously introduced a metabolic branch point 
into transgenic tobacco by transformation 
with a 35S mtlD gene construction (13). 
Mannitol accumulation in leaves and roots 
of these transformed plants was estimated at 
a maximum concentration of 100 mM (13), 
if a cytosolic location is assumed. Here we 
determine whether the tobacco plant, 
which does not normally produce and ac- 
cumulate mannitol, is protected from salin- 
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ity stress by this introduced osmolyte. 

The growth of 6-week-old plants in the 
absence and presence of added NaCl was 
assessed. We evaluated plant growth by 
measuring the percent of change in height 
and in fresh weight (14). Initial heights and 
weights for all plants of each experiment 
were measured just before NaCl addition 
(J5). For nonstressed plants, final measure- 
ments were taken at the onset of flowering 
(after 9 to 10 days of culture). Final heights 
and fresh weights for stressed plants were 
measured after 14 and 30 days of culture 
because these plants did not flower. No 
differences were observed between control 
and mannitol-containing plants after 10 
days of NaCl treatment. Plants were eval- 
uated from five different lines: the two 
controls, SRI, wild-type tobacco cv. SRI 
and SRI 1 ™, kanamycin-resistant tobacco, 
which was transformed with a gene con- 
struction identical to that used for mannitol 
expression (13) except for the absence of 
the mtlD gene, and three lines of manitol- 
containing transformants, 1-mtl, 2-mtl, 



MF ... 



and 3-mtl (16). Plants were hydroponically 
cultured under controlled conditions (17). 
All plants from mtl lines contained manni- 
tol, whereas mannitol was not detected in 
plants from control lines (13). 

When cultured without added NaCl, 
both the control and mtl plants increased 
about six- to sevenfold in height and more 
than twofold in fresh weight over the 9- to 
10-day interval before flowers appeared 
(Table 1). Statistical analyses of percent 
change in height and fresh weight [single 
degree of freedom contrasts from a com- 
bined analysis of variance (ANOVA) of 
two experiments] indicated no significant 
differences in these variables between the 
two control lines or between the control 
lines and the three mtl lines (P = 0.17 to 
0.63). Thus, neither the transformation 
protocol nor mannitol accumulation influ- 
enced plant growth in the absence of excess 
NaCl. 

To determine the NaCl concentrations 
at which mannitol may affect growth, we 
conducted pilot studies with 6-week-old 
plants exposed to various concentrations of 
NaCl (100 to 300 mM) for 30 days. Primar- 
ily on the basis of visual analysis, growth of 
plants containing mannitol could be distin- 



Table 1. Growth of tobacco plants in the ab- 
sence of added NaCl. All data from two sepa- 
rate experiments are shown. Seeds were ger- 
minated and plants were cultured in an equal 
mixture of vermiculite, potting soil, and sand 
under greenhouse conditions for 3 weeks. In- 
tact plants were then transferred to a hydropon- 
ics system and cultured in a growth room for 
-3 weeks (±1 day) (17). Height of the aerial 
portion (in centimeters) and total fresh weight of 
individual plants (in grams) were measured 
(initial measurement, /). These two traits were 
measured again for the same plants just before 
flowering, 9 to 10 days later (final measure- 
ment, F). Roots were blotted dry before weight 
measurements were taken. Percent change 
was calculated as [{F/l) - 1] 100. Lines evalu- 
ated (76): SR1, wild type; SR1 km , kanamycin- 
resistant; and 1-mtl, 2-mtl, and 3-mtl (three 
individual transformants), kanamycin-resistant 
and expressing 35S mtlD {13). Evaluations 
were performed blindly. Data are mean values 
± SEM; n, total number of plants. 



Fig. 1. Control and mannitol-containing tobac- 
co plants after 30 days of culture in the pres- 
ence of added NaCl (250 mM). Plants were 
cultured as described (17). Left plant, wild type 
(SR1); right plant, 35S mtlD transformant (1- 
mtl). 
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Plant 
line 



Height 
(% change) 



Fresh weight 
(% change) 



Experiment 1 
SR1 748.8 ± 1 10.9 301 .5 ± 29.4 8 
SRI 1 ™ 718.1 ± 95.6 248.0 ±20.1 6 



1- mtl 791 .4 ± 78.6 

2- mtl 871.9 ± 91.4 

3- mtl 615.7 ± 36.2 



330.8 ± 26.3 8 
276.1 ±31.6 6 
270.6 ±17.6 8 



Experiment 2 

SR1 577.3 ± 26.1 199.7 ± 5.7 8 

SRI*" 1 546.8 ± 33.2 202.7 ± 6.5 8 

1- mtl 560.8 ± 48.1 200.5 ±13.1 8 

2- mtl 643.9 ± 29.5 250.3 ± 9.8 8 

3- mtl 582.3 ± 40.3 203.0 ±12.4 8 



guished from that of control plants at high 
salinity (for example, 250 mM). Thus, we 
pursued this initial observation by assessing 
the growth of plants cultured under the 
extreme, or shock, condition of 250 mM 
NaCL 

After 14 days of treatment with 250 mM 
NaCl, plant growth from both control and 
mtl lines was severely inhibited (Table 2). 
Stem elongation of salt-treated plants was 
generally less than 10% of that for the 




Fig. 2. Roots from control and mannitol-contain- 
ing tobacco plants after 30 days of culture in 
the absence and presence of added NaCl (250 
mM). Plants were cultured as described (17). 
From left to right: wild-type plant (SR1) cultured 
without added NaCl, 35 S mtID transformant 
(1-mtl) cultured without added NaCl, SR1 plant 
cultured in 250 mM NaCl, and 1-mtl plant 
cultured in 250 mM NaCl. 



nonstressed plants after 9 to 10 days of 
culture. Additionally, plants typically did 
not flower after 14 days of stress. The fresh 
weight of plants from most lines decreased 
because of necrosis and dehydration of low- 
er leaves. A comparison of the different 
lines for each experiment shows that after 
14 days of stress there were no clear differ- 
ences between the growth of plants from 
control and mtl lines: 

After 30 days of exposure to high salin- 
ity, however, this difference could be dis- 
tinguished, with plants that contained 
mannitol showing increased tolerance (Ta- 
ble 2). In both experiments, plants from 
mtl lines had decreased weight loss relative 
to that for plants of control lines. Addition- 
ally, mtl transformants increased in height 
a mean of 80% (range of 53 to 121%) 
compared with their height at 14 days. 
Conversely, control plants increased in 
height only a mean of 22% (range of 3 to 
32%) over the same interval. Statistical 
analyses (single degree of freedom contrasts 
from a combined ANOVA for both exper- 
iments) documented significant differences 
between control and mtl lines in mean 
height (P = 0.0006) and fresh weight (P = 
0.0001). No such differences were observed 
between the two control lines. Thus, on 
the basis of height and fresh weight, plants 
that contained mannitol showed increased 
tolerance to high salinity relative to control 
plants. Variability in results (for example, 
with 3 -mtl plants) may reflect subtle differ- 
ences in plant development among the 
lines just before NaCl treatment or in the 
experimental conditions (18). 
^ Control and transgenic plants also ap- 
peared different after 30 days of stress with 
250 mM NaCl (Figs. 1 and 2 and Table 3)^ 
With all plants, roots exposed to 250 mM 
NaCl were apparently unable to survive 



treatment after about 10 days and appeared 
dark brown in color (salt-stressed control 
roots, Fig. 2). However* plants that con- 
tained mannitol often produced new roots 
and then new leaves (Figs. 1 and 2 and 
Table 3). New root and leaf growth oc- 
curred in a mean of 75% (range of 60 to 
88%) of plants from mtl lines, whereas such 
growth occurred in only a mean of 33% 
(range of 13 to 50%) of plants from control 
lines (Table 3). These new roots typically 
appeared after about 14 days of NaCl treat- 
ment. Flowering (Fig. 1) was observed in 
73% of plants that generated new roots and 
leaves. Although it is not clear how intra- 
cellular mannitol accumulation may lead to 
new root growth, the osmolyte may affect 
processes at the cellular level that are in- 
volved in formation of new roots* 

The increased production of new roots 
and the subsequent emergence of new 
leaves and then flowers indicate that the 
relative increases in height and fresh weight 
of plants containing mannitol are due to 
new growth rather than to a reallocation of 
resources. Additional experiments with dry 
weight analyses (which were not carried out 
in our time course studies because of their 
destructive nature) should confirm this ob- 
servation. 

Sugar alcohols may contribute to toler- 
ance at the cellular level by adjustment of 
the cytosolic osmotic potential when the 
concentration of electrolytes is lower in the 
cytosol than in the vacuole (2). These 
compounds may also protect membranes 
and proteins in the presence of high con- 
centrations of electrolytes (4, 19). An as- 
sumption in both mechanisms is that the 
osmolytes are localized in the cytosol. Our 
data do not distinguish between these 
mechanisms. Depending on the cell type, 
its normal responses to stress, and environ- 



Table 3. Visual assessment of new root and leaf 
growth in tobacco plants after 30 days of expo- 
sure to 250 mM NaCl. All data from two sepa- 
rate experiments are shown. Evaluations were 
performed blindly. Procedures and abbrevia- 
tions as in Table 1 . 



Plant 
line 


Plants with new 
roots and leaves 


n 




Experiment 1 




SR1 


' 1 


8 




2 


8 


1-mtl 


4 


6 


2-mtl 


6 


8 


3-mtl 


7 


8 




Experiment 2 




SR1 


5 


10 


SRI 1 ™ 


4 


10 


1-mtl 


8 


10 


2-mtl 


8 


10 


3-mtl 


6 


10 



Table 2. Growth of tobacco plants in the presence of 250 mM NaCl. All data from two separate 
experiments are shown. Plant height and fresh weight were measured from — 6-week-old plants 
(±1 day) just before salt addition (added all at once) and then at 14 and 30 days after addition. 
Evaluations were performed blindly. Procedures and abbreviations as in Table 1 . Data are mean 
values ± SEM. 



Height (% change) Fresh weight (% change) 



14 days 30 days 14 days 30 days 



Experiment 1 



SR1 


45.9 ± 


3.9 


60.3 




9.5 


-14.0 ± 


5.8 


-48.4 ± 11.1 


8 


SRI*™ 


66.9 ± 


8.5 


68.7 


± 


10.8 


-9.5 ± 


9.7 


-44.7 ± 16.8 


8 


1-mtl 


59.4 ± 


4.2 


113.4 


± 


29.8 


-1.7 ± 


12.2 


-7.7 ± 24.9 


6 


2-mtl 


54.4 ± 


9.6 


83.3 




14.9 


-1.6 ± 


4.9 


-14.8 ± 10.4 


8 


3-mtl 


74.5 ± 


10.2 


129.3 


± 


12.0 


+ 16.3 ± 


5.2 


+27.6 ± 12.0 


8 












Experiment 2 








SR1 


23.1 ± 


2.8 


28.0 




3.0 


-9.1 ± 


5.7 


-34.0 ± 8.2 


10 


SRl km 


36.5 ± 


3.5 


48.2 




5.4 


-7.3 ± 


5.6 


-36.2 ± 7.5 


10 


1-mtl 


29.4 ± 


3.1 


55.1 


± 


7:1 


-6.5 ± 


3.6 


-18.4 ± 10.4 


10 


2-mtl 


19.2 ± 


2.8 


42.5 


± 


7.3 


-6.9 ± 


5.6 


-14.7 ± 9.3 


10 


3-mtl 


29.0 ± 


2.3 


44.8 




3.4 


-4.3 ± 


3.4 


-14.8 ± 9.8 


10 
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mental conditions, a contribution in tobac- 
co cells of 100 mM mannitol may be con- 
sistent with both mechanisms. However, 
uniform distribution of mannitol at the 
subcellular level would likely preclude cy- 
tosolic osmoregulation. Determination of 
both the subcellular location of mannitol 
and the concentrations of the sugar alcohol 
in tissues will help to distinguish between 
these two mechanisms. 

Alternatively, mannitol may metaboli- 
cally predispose tobacco cells to stress tol- 
erance. Thus, the cellular accumulation of 
mannitol, which is normally foreign in 
tobacco cells, may increase the response of 
metabolic pathways normally involved in 
stress tolerance, which thereby allow cells 
to withstand stress. 

On the basis of pilot experiments, we 
studied the growth response of plants of a 
defined age that were exposed to the ex- 
treme, or shock, condition of 250 mM 
NaCl. Additional experiments that vary 
plant age (development) and NaCl concen- 
tration (added all at once or incremental- 
ly), as well as experiments that investigate 
the effects of other environmental stresses 
such as drought and cold, will help to 
explain the function of mannitol in higher 
plants. We have demonstrated that for 
tobacco, the presence of mannitol in vivo 
protects against high salinity. Sugar alcohol 
accumulation may also enhance stress tol- 
erance in other plants. 
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change, for each line. Plants not receiving NaCl: 
SR1, 6.1 ± 0.4/21.6 ± 1.1, 7.3 ± 0.5/26.9 ± 0,9; 
SRI 1 ™, 6.7 * 0.5/27.2 ± 1.2, 7.6 ± 0.3/27.6 ± 0.9; 

1- mtl, 6.4 ± 0.6/23.7 ± 1.5, 7.3 ± 0.6/26.0 ±1.1; 

2- mtl, 5.9 ± 0.3/27.3 ± 1.1, 6.4 ± 0.4/24.9 ± 0.8; 

3- mtt, 6.3 ± 0.3/21.9 ± 1.9, 7.1 ± 0.5/27.1 ± 1.5. 
Plants subsequently receiving NaCl treatment: 
SR1, 7.1 ± 0.4/27.9 ± 2.0, 7.4 ± 0.2/26.1 ± 1.3; 
SRI*" 1 , 9.4 ± 0.4/30.8 ± 1.1,7.1 ± 0.3/25.5 ± 1 .0; 

1- mtl, 7.4 ± 0.4/26.4 ± 1.4, 7.2 + 0.3/28.1 ± 0.7; 

2- mtl, 6.8 ± 0.3/27.5 ± 1.4, 6.6 ± 0.3/27.4 ± 0.7; 

3- mtl, 7.3 ± 0.4/26.3 ± 1.6. 7.1 ± 0.2/27.9 ± 0.8. 
Values are mean ± SEM; for number of plants 
evaluated; see Tables 1 and 2. 

16. For lines that express 35S mtID, 30 primary 
transformants were chosen randomly and self- 
pollinated, and the progeny were selected for 
single-locus inserts on the basis of expression of 
kanamycin resistance. Of the lines that contain 
single locus inserts, three were randomly cho- 
sen (1-mtl, 2-mtl, 3-mtl), self-pollinated again, 
and determined to be homozygous on the basis 
of expression of kanamycin resistance, this pro- 
cedure was also used for control SRI 1 ™, except 
that five primary transformants were chosen. 
Lines analyzed were SR1, wild type; SRI 1 ™, 
kanamycin-reststant; and 1-mtl, 2-mtl, and 3-mtl, 
three different kanamycin-resistant lines that ex- 
press 35S mtID. 

17. The hydroponics system consisted of two genet- 
ically identical plants cultured in containers with 
half-strength Hoagland's nutrient solution (5 li- 



Active sites of proteins are typically com- 
posed of recognition elements guided into 
proximity and appropriate orientation by 
the native protein fold. Individual recogni- 
tion elements may be remote in primary 
structure or may be located on different 
polypeptide chains in multisubunit pro- 
teins. With site-directed mutagenesis, ami- 
no acids that constitute individual recogni- 
tion elements can be changed without af- 
fecting the overall orientation of the recog- 
nition domain. Yet present technology does 
not allow predictable and routine changes in 
the orientation of the domains themselves. 
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ters). (For Figs. 1 and 2, plants from SR1 and 1-mtl 
were cultured together in 250 mM NaCl in hydro- 
ponic containers.) The nutrient solution was 
changed every 2 weeks. Plants were inserted into 
the nutrient solution through two holes in the 
container lid. The growth room housed eight to ten 
plots of five randomly placed containers (one for 
each line). For each experiment, halt of the plots 
were cultured in the absence of added NaCl and 
the other half were cultured in the presence of 250 
mM NaCl. To minimize variability in growth and 
development of plants before stress, all plants (for 
the data reported in Tables 1 to 3) were approx- 
imately the same age (6 weeks) and were select- 
ed from larger groups for inclusion in each exper- 
iment for their similar heights and weights. For the 
two experiments, plants were visually identical 
before NaCl treatment. To eliminate potential ex- 
perimenter bias, evaluations were performed 
blindly. Growth-room conditions were 18° and 
25*C, night and day temperatures, respectively; 
photoperiod, 12 hours light (500 |iE rrr 2 s~ 1 ). 

18. R. L Mott and F. C. Steward, Ann. Bot. 36, 621 
(1972); ibid., p. 641; ibid., p. 655; ibid., p. 915. 

19. B. Schobert, J. Theor. Biol. 68, 17 (1977). 
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The transcriptional activator protein 
GCN4 is one of a large family of DNA- 
binding proteins identified by a bZIP struc- 
tural motif (1); this motif contains a DNA 
contact domain characterized by conserved 
basic and hydrophobic residues (b domain) , 
and a dimerization domain identified by a 
heptad repeat of leucine residues (ZIP do- 
main) (2, 3). The two domains are separat- 
ed by a six-amino acid linker whose length, 
but not sequence is conserved across bZIP 
families (J). Previous work has demonstrat- 
ed that the active DNA-binding entity is 
generated when the ZIP domains of two 
protein monomers assemble (4) into a par- 
allel coiled coil (5, 6). The scissors grip (I) 
and induced helical fork (7) models propose 
that the coiled coil, the natural dimeriza- 



Altered Specificity of DNA-Binding Proteins with 
Transition Metal Dimerization Domains 

Bernard Cuenoud and Alanna Schepartz* 

The bZIP motif is characterized by a leucine zipper domain that mediates dimerization and 
a basic domain that contacts DNA. A series of transition metal dimerization domains were 
used to alter systematically the relative orientation of basic domain peptides. Both the 
affinity and the specificity of the peptide-DNA interaction depend on domain orientation. 
These results indicate that the precise configuration linking the domains is important; 
dimerization is not always sufficient for DNA binding. This approach to studying the effect 
of orientation on protein function complements mutagenesis and could be used in many 
systems. 
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Molecular Cloning and Physical Mapping of the otsBA Genes, 
Which Encode the Osmoregulatory Trehalose Pathway of 
Escherichia coli: Evidence that Transcription Is 
Activated by KatF (AppR) 

INGA KAASEN. PAL FALKENBERG. OLAF B. STYRVOLD, and ARNE R. STROM* 
The Norwegian College of Fishery Science, University o/Troms0. N-9000 Tromse. Norway 
Received 16 July 1991/Accepted 26 November 1991 

It has been shown previously that the otsA and otsB mutations block osmoregulatory trehalose synthesis at 
Escherichia coli. We report that the transcription of these osmoreguJated ots genes is dependent on Katf 
(AppR), a putative sigma factor for certain stationary phase* and starvation-induced genes. The transcription 
of the osrooregulated bet and proU genes was not kaiF dependent. Our genetic analyses showed that katF earns 
an amber mutation In £. coir K-12 and many of its derivatives bat that katF has reverted to an active form m 
the much-used strain MC4100. This amber mutation In kaiF leads to strain variations in trehalose synthesis and 
other kaiF -dependent tactions of £. coli. We have performed a molecular cloning of the oisBA genes, and we 
present evidence that they constitute an operoo encoding trehalose-6-phosphate phosphatase and trehalose-** 
phosphate synthase. A doning and restriction sit* analysis, performed by comparing the cloned fragments with 
the known physical map of the £. coli chromosome, revealed that the otsBA genes are situated on a 2.9-tt 
HindUl fragment located 8 to 11 kb dock wise of tar (41.6 min). 



Trehalose, a nonreducing dtsaccharide of glucose, is a 
stress metabolite in various organisms (29). Saccharomyces 
cerevisiae accumulates trehalose when exposed to an ele- 
vated temperature of growth (3. 24) or to hazardous chemical 
agents such as ethanol, copper sulfate, or hydrogen peroxide 
(3). Rhizobia accumulate trehalose when stressed with low- 
oxygen (e.g.. 1%) tension, regardless of the composition of 
the growth medium (23). Many phototrophic and heterotro- 
phic bacteria, including Escherichia coli. accumulate 
trehalose in response to osmotic stress (18. 31. 43. 50). 
Trehalose is shown to preserve the function and integrity of 
biological membranes exposed to conditions of low water 
activity (14) and to confer desiccation tolerance to yeasts 
(24), to spores of Streptomyces sp. (36), and to nematodes 
(14); frost tolerance to insects (2) and yeasts (22); and 
osmotic tolerance to £. coli (19). In yeasts, trehalose accu- 
mulation during growth in liquid culture coincides with an 
increased plating efficiency on agar plates of low water 
activity (34). 

In £. coli. the osmoregulatory trehalose pathway consists 
of a trehalose-6-phosphate synthase which converts UDP- 
glucose and glucose -6-phosphaie to trehalose-6-phosphate 
and a phosphatase which dephosphorylates this metabolic 
intermediate (reference 19 and this study). Two insertion 
mutations, named ots A and otsB. which block the synthesis 
of the synthase, have previously been mapped to 42 min. but 
the trehalose -6-phosphate phosphatase activity of these mu- 
tants was not reported (19). However, a point mutation 
named otsP which causes accumulation of trehalose -6-phos- 
phate in stressed cells, presumably because of a defective 
phosphatase, was mapped near otsA (27). This mutation 
appears to be allelic with otsB (reference 27 and this study). 

Trehalose accumulation in stressed cells of £. coli is 
regulated at several levels. Experiments with lac fusions 
have shown that the otsA and otsB genes are transcription- 
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ally activate by osmotic stress, and biochemical data have 
shown that the synthase is activated by potassium ghrtamate 
(19). At the cellular level, trehalose accumulation is regu- 
lated by a futile cycle. Trehalose is overproduced in the 
cytoplasm, and the excess is excreted and split to gbcose by 
a periplasmic trehalase (TreA) and then rcutifaed (52). 
Similarly to the ots genes, the treA gene is transcraxjooally 
activated by osmotic stress but not by external trehalose (8). 
treA maps at 26 min (8). Rod et al. (47) reported that £. coli 
K-12 and most of its derivatives carry an amber mutation 
that leads to decreased accumulation of trehalose m osmot- 
ically stressed cells. Styrvoid and Strom (32) showed that 
this amber mutation was not in the otsA and otxB genes 
themselves but that it caused decreased transcription of 
these genes. 

Trehalose can serve as the sole source of energy and 
carbon in £. coli. At high osmolality, trehalose affixation 
depends on the TreA activity (20). But at low osaoiarity, 
trehalose induces a transport protein. EH Tr * (TreBl and a 
pathway consisting of at least a catabolic trehaiose-6-phos- 
phate phosphatase (TrcE) and an amylotrehalase (TreC). 
treB. treC. and presumably treE map at 96.5 min (9). Their 
internal inducer is trehaJose-6- phosphate; therefore, these 
genes are not expressed under osmotic stress when the 
osmoregulatory trehalose -6-phosphate phosphatase is oper- 
ative (27). A mutation which seems to influence expression 
of otsA. otsP (otsB), and treC. but not treB. has been 
mapped to 84 min, but the nature of this gene remains 
unclear (27). 

In this investigation, we have cloned the otsBA genes. We 
report that they constitute an operon which encodes the 
phosphatase and the synthase, respectively, of the osmoreg- 
ulatory trehalose pathway. The amber mutation which influ- 
ences otsBA expression was mapped at 59 min and shown to 
be in katF. which encodes a putative sigma factor (39) for 
starvation- and stationary phase-induced genes (30. 37). 
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MATERIALS AM) METHODS 

Bacterial strains and growti conditions. The bacterial 
strains used are listed in Table 1. The growth media (medium 
63 (38). low-osmolarity minima] medium (LOM) [11J. and 
LB medium |38() and the growth conditions used were 
exactly as described previously by us (52). 

Genetic procedures and strain constructions. Conjugation 
by mating on agar plates and PI transduction were per- 
formed as described by Miller (38). When transferring Tn/0 
insertions and IacZ fusions generated with A p/ocMu. we 
selected for resistance against tetracycline and kanamycin. 
respectively. Tn/0 was deleted by using the positive selec- 
tion method described by Bochneret aJ. (7). The presence of 
a recA mutation was tested by determining UV sensitivity as 
described previously (35). For strains carrying otsAAacZ or 
otsB-lacZ operon fusions, the otsX (katF) genotype was 
routinely checked by scoring their Lac phenotype by growth 
on medium 63 with lactose as an energy source (see below). 
The pedigrees of the strains constructed in this study are 
given in Table 1. Further description of strain construction is 
given in this section and in Results. 

A culture of MC4100 carrying a random selection of Tn/0 
insertions was prepared by infecting MC4100 with the phage 
K NK561 essentially as described previously (49). After the 
transposition, the cells were spread on 50 agar plates con- 
taining LB medium with 25 ug of tetracycline per ml and 4 
mM sodium citrate. Approximately 50.000 Tc f colonies 
representing individual transposition events were pooled and 
used for production of PI lysate. 

We verified that our strain of MC4100 did not carry an 
amber suppressor in the following way. A Tn/0 insertion 
was introduced near an amber mutation by transducing 
BL50 [metF(Am)] with a lysate prepared from GDI (zih:: 
Tn/0) by selecting for Tc r and screening for Met". A PI 
lysate of a resulting strain, PF11 lmetF(Aa\) z/A::Tn/0J. was 
then used to transduce MC4100 to Tc f . Of 50 Tc f transduc- 
tants tested. 19 were Met", snowing that the metFiAm) 
mutation was not suppressed in MC4100. 

FF1005 was constructed by conjugational crossing of 
JC10240 with MC4100 by selecting for jW::Tn/0 (Tel and 
screening for recA (UV 1 ). By also checking that the trans- 
ductant gave an osmotically tolerant phenotype on agar 
plates with medium 63 with 0 J M NaCI added, we ensured 
thai FF1005 carried the onJT^ioo aU*** recipient 
and not the otsX allele of the donor. 

In order to verify that LCB107 carries a supE mutation 
and to construct a suppressor-free LCB107 derivative, a PI 
lysate prepared from BL50 [metB* metfiAm)} was first used 
to transduce the LCB107-derived IK5 (meiBt metF* supE?) 
to Met*. As metB and metF are neighbor genes on the 
chromosome (4), Met* transductants that carry an amber 
suppressor could have either a metB* metFiAm) or a metB* 
metF* genotype. To investigate their genotype, four trans- 
ductants from this cross were transduced to Tc r with a PI 
lysate prepared from JW1071 (sup 0 z£>/::Tn/0), which carries 
a Tn/0 marker within cotransductional distance of gtnV, i.e., 
the native jwpEgene (4). The progeny from each of the four 
crosses displayed both Met* (30%) and Met" (70%) geno- 
types. This showed that their parental strains were supE 
metFi Am). One of the Met* transductants of strain IK5 was 
named 1K58. Strain IK59 was a sup 0 metFiAm) derivative of 
IK58. 

To introduce the ofjJr MC4 joo allele into an LCB107 
background, we first transduced the LCB107 derivative 
IK58 with a PI lysate prepared from IK68 (cysC mutS:: 



Tn/0). selecting for Tc' and screening for Cys". The result- 
ing strain. IK60. was transduced with a PI lysate prepared 
from FF1112 (cysC* mutS* otsX* MC4XOO ). selecting for 
Cys* and screening for Tc\ As otsX is located between the 
cxsC and the mutS genes (see below), a Cys* Tc* transduc- 
tant such as IK61 should carry the otsX^ MC4Xoa allele. 

General recombinant DNA procedures and doning of otsA 
and oaB. Transformations were done by the method oi 
Chung et al. (13) or a standard CaCU method. Plasmid 
isolation, restriction cleavage of DNA. and ligation were 
done by standard methods. 

Restriction maps of the chromosomally derived parts of 
the constructed plasmids are shown in Fig. 2. and the 
plasmids are listed in Table 1. A gene library of strain CSH7 
was constructed in the pBR322-derived cosmid vector cos4 
(46) as described by Andrescn et al. (1). Plasmids were 
introduced into the osmotically sensitive strain FF4169 
(otsA::TnI0). and osmotically tolerant, trehaiose-producing 
clones were selected by plating on medium 63 with 0.45 M 
NaCI added. The plasmid pFFl, selected in this way, 
contained a 43-kb chromosomal fragment (see Fig. 2). The 
left end of the fragment is proximal to the Sail site of the 
vector. The plasmid pFFl was maximally shortened by Sail 
to give pFFlOl and by BamHl to give pFF102. A 2.9-kb 
///VidJII fragment from pFFlOl was subcloned into the 
///ndlll site of pACYC184 (12) to give the subclone pFF106. 
The left //indUI site of the chromosomal insert of pFF106 
(see Fig. 2) is proximal to the Clal site of the vector. HindlU 
shortening of pFF102 yielded pFF109. In the construction of 
pFFl 14. plasmid pFF106 was shortened by EcoRV so that 
0.8 kb of the insert and 0.2 kb of the vector were deleted. 

TrehaJase-6-phosphat* synthase and trehaJos*4-pbosphate 
phosphatase activities. The trehalose -6-phosphate synthase 
and phosphatase enzyme activities were measured in vitro 
by using cells permeabilized with toluene (19). In the syn- 
thase assay, trehalose-6-phosphate formed from UDP-glu- 
cose and giucosc-6-phosphate was dephosphorylated by a 
treatment with alkaline phosphatase (19. 52) and the 
trehalose formed was iruncthylsilylated and determined by 
gas chromatography exactly as described previously (19). 
The standard reaction mixture for determination of the 
phosphatase activity contained (in a 150-uJ volume) the 
following: 1.5 uinol of trehalosc-6-phosphate, 5 iimol of 
Tris-hydrochloride (pH 7.4), 0.4 umol of MgCU. and 125 to 
500 ^g of cell protein. Tbe reaction mixture was incubated 
up to 12 min at 37°C, and the reaction was terminated by 
heating for S min in a boiling-water bath. Sucrose (0.25 junol 
in a 25-ui volume) was then added as an internal standard, 
and denatured protein was removed by centrifugation. For 
desalting, a sample of 150 uJ was applied to a column (0.5 by 
2 cm) packed with equal amounts of Dowex 50X4-200 in FT 
form and Dowex 1X8-400 in formate form. Free sugars were 
washed through the column with 1 ml of water, and the 
eluate was fireeze-dried. Gas chromatographic determination 
of trimethyisilylated trehalose was then performed exactly 
as described previously (19). One unit of synthase or phos- 
phatase activity equals 1 nmol of trehalose produced per min 
at37°C 

Other aaethods and special chemicals. Cell protein was 
determined by the biuret method as modified for whole 
bacterial cells (21). The p-galactosidase activity of IacZ 
fusion mutants was determined quantitatively by using cells 
treated with chloroform and sodium dodecyl sulfate as 
described previously (19. 38). One unit of ^-galactosidase 
activity equals 1 nmol of <wiitrophenol produced per min at 
28°C. Catalase activity and glycogen accumulation were 
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TABLE I. t\ n>ti strain*, bacteriophage?, and plasmids 



Strain, 
bactcnophift. 
or plasmid 



Description" 



Con \t men or tourcc* 



Strains 

AT2427 ^ Hfr tysJ43 retAi //»•/ tpoTI X 

BL50 F* rhr-HAm) leuB6 hts~t metFI59(Am) tda-50 rpxL/36 thi*t aru-M mil* I rv/*5 tsx-78 
taiY 

CSH7. F ~tacY( Am) rpsL thi 

OC906 F* retAI spoTl metBI fucr.TnlO cvsiSJt Ami otsX 

DC931 W1485 vYsiSM Ami locZI25iAm) 

DC943 W1485 cysl53i Am) tocZI25<Am) supE42 

ES1481 F* /<ifZ5J(Am» mutS2IS::TnlO ih\AJ6 rhu-5 metBI deoC2 \SirrnD-rrnE\n X * uttjir 

FF1005 MC4100 recA$6 srIM&.TnIO otsX* MC4IOD 

FF1 1 12 MC4100 QiotsBtacM «aJT Mt4too 

FF1608 MC4100 <P<otsB-lacZr9 MC4lfl0 

FF2032 MC4100 <tHotsA~tacZi7 «M* MC4lO0 

FF4012 MC4100 otsAI::TnlO *iotsB-lacZV9 otsX* HC4l(n 

FF4019 MC4100 ire I Mzef-22*.:lnlQ) otsX' MC4im 

FF4025 MC4100 ire-l Kzcf229::TnlO) QiotsBlaiZr? *>tsX* MC4lQD 

FF4026 MC4100 ire I Mzcf22+.:TnlO) 4><otsA-laeZ)7 «mJT MC4MB 

FF4031 MC4100 MotsAhiTnIO <tHotsBlacZ)8)l otsX' MV4lan 

FF4035 r MC4100 MotsA I <toots8lacZ) otsX* Mir4)eo 

FF4037 MC4100 ^/jA/::Tn/0 4Xr>wa-/<irZV?|/ /rM::Tn/tf otsX m Ml - M 

FF4049 MC4100 .&|0f*4/::Tn/0 *ionB-lavZ\8\l MtreAr.TnJO) otsX* ut 4im 

FF4050 MC4100 MotsAi::TnW 

FF4055 f MC4100 A(wA) QiotsB-lacZ) cvj-95::Tn/0 «/J*w t4J( , 

FF4056 rjrf MC4100 4*otsB~lacZ) Mcvs95::TntO\ <>tsX wl4K <! 

FF4057 r ^ MC4100 UotsA ) QiotsB tacZ) S(cvs95::Tn/0) treAiiTniO atsX wt4m *? 

FF4169 MC41OO««A/::Tn/0 

FF4171 MC4100/rM::Tn/0 

GDI MC4100 zih-730::TnlO jttpR 

IK1 MC4100 QiotsB-tacDB fucr.TnIO o/iJT Mt 4I00 

IK2 W1485 Mac otsX ww 

IK3 W1485 Mac <t*otsA*1ae Ztf oisX WX4M5 

IK4 W1485 Mac QiotsB-taeZX otiX wt49S 

IKS LCB107 Mac &otsA-1ocZ)7 otsX^^ supE44 

IK6 MC4100 <Ko«*-/<icZk? sH-300::TnlO owJT MC4100 

IK10 W1485 A/ac *<otsB-lacZ)8 sri-300::TnlO c\sI53( Am) *«* wi4ftl 

IK12 MC4100 <tXo/j*9-/«irZK rrj.«::Tn/0 „ C4IB0 

IK19 W1485 A/<ir tootsB-taeZ* sH-300::TtilO otsX' Mt - 4I00 

IK21 W1485 A/flr <tKo«a /flfZ>» mu/52/5::Tn/0 otsX wl4ns 

IK25 W1485 A/flf 4KWi9-/afZtf ro-95::Tn/0 otsX' Mi:4l0O 

IK26 W1485 A/ac QiotsB-taeZ* c\s-9S::TnlO otsX wl4n < 

IK35 W1485 Mac <tHo/iir-/flrZkT om^h,, 

IK41 W1485 Mac <tHouB-tocZ)8 cysI53(Am) otsX WX4mA supE42 

IK43 MC4100 <t*otsA-lacZ\7 ers-9S::TtiiO otsX wl4MS 

IK44 MC4100 <PiotsAJocZ)7 otsX WX€0 

IK45 MC4100 otsAI::TnlO oaJT^wo 

IK46 MC4100 c/M/::To/0 orx* wl4M 

1K49 F'2 <tKAf/r-/acZy/MC4100 ouA/::Tn/0 0*1***0100 

IK50 F2 <Ubet T-lacZWMC4W otsA I ::Tn 10 otsX wl4a$ 

IK31 F'2 Q(betB-lacZ)2fMC4\Ob otsAI:;Tr\IO ou** MC 4ioo 

IK32 F'2 <MbetB*tacZI2MC4\<lQ otsA I : :Tn 10 otsX WX4ms 

IK38 LCB107 Mac 4<otsA-lacZ)7 mtiF159< Am) wJr LCBI07 ^«/p£44 

IK39 LCB 107 Mac Q(otsA-lacZ)? meiFI59{ Am) o/j* LCB107 V ^/-3(?7: :Tn/0 

IK60 LCB 107 Mac &otsA-!acZ)7 m*tF159(Am) cvsC mutS2tS::TnlO otsX' supE44 

IK61 LCB107 Mac <t*otsA-lacZ)7 metFI59{Am) onX* MC4X00 supE44 

IK63 LCB107 Mac <t>(ot$A-1acZ)7 metFI59(Am) otsX* MC4XOO sup 9 zbf507::TnlO 

IK65* MC4100 WoisB-tacZX rpoS359::TniO 

IK68 F" cysC argA lysA pro mutS2!5;:Tn/0 oaJT 

IKW W1485 Mac &otsB-tacZ)8 rpoS359 :TniO 

IK70* W1485 Mac <tHotsB~lacZ)8 cysI53( Am)? rpoS359: :Tn/0 ji/p£^2 

JC10240 Hfr cA56 fnr-iflO *W-JOfc:Tn/0 Mi-/ r^M/ jpof/ rpsE2300 X" o/jJT 

JW1071 Hfr sup 0 zbf-507::TnW trp49 lacZ\25 retAI spoTl X * 

K-12 F* wild type 

KM 78 F" cysC argA lysA pro 

LCB107 F" tre-l trpA43 metBI lacYI malAI (XT rpsU34 o/jjr LCB ,07 supE44? 

MC4100 F* araDI39 MargF-tac)UI69ftbB530l relAI rpsL150 deoCl ptsF25 rbsR otsX* UC4ltti 

MLE413 F'2 <t*betB-tacZ)2/MC4\m recAS6 Rif 



CGSC JSC 
B. LowUv 

38 

D. P. dart i47) 
D. P. Gait »47) 
D. P. Clart i47> 
CGSC 70J» 
MC4100 ' IC10240 
19 
19 
19 

PUFF 1601) < FF4169 

19 

19 

PKFF2033* FF4019 
19 

Tc* Km % c/FF4012 
PKUE5) » FF4031 
Tc^ of FF4QJ7 
PKFF1005) * FF4049 
PKIK26) * FF4035 
Tc* of FF4055 
PKFF4171I » FF4056 
19 
52 

B. Lowsky 
PKDC906I « FF1112 
Tc^ofOSlI 
PKFF2032) * IK2 
PKFFU12) » IK2 
Tc k ofOS36 
PKIK10) * FF1112 
PKFF1005) m PR 
PKN3002) * FF1112 
PKIK6) x 1X4 
PKES1481) * IK4 
PKIK12) x 0C4 
PK1K12) x UC4 
PKLCB107) x IK25 
PUDC943) « PF2 
PKIK26) x FF2032 
PKFF1U2) x UC43 
PKFF4169) x FF2032 
PKFF4169) x IK44 
MLE914 x OC45 
MLE914 x fX46 
MLE413 x 1X45 
MLE413 x UC46 
PKBL50) x IK5 
PKJW1071) x IK58 
PKIK68) x 1K» 
PKFF1112) x IK60 
Pl(JWi07U x 1X61 
PKRH90) x FF1U2 
PKES1481) x KM78 
PKRH90) x (X4 
PKRH90) x 1X41 
CGSC 6074 
CGSC 6391 

W. Epstein 
CGSC 6407 (6) 
CGSC 6152 
M. W. EshooilT) 



Continued on follow* page 
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TABLE 1— Continued 



Strain, 
bacteriophage, 
or pfasmid 



Description'' 



Const! 



MLE914 F*2 <tH^fr /arZW/MC4100 revA36 Rif 

N3002 F" cvj-95::Tn/0 WirrnD-rrnEW A oisX 

0518 W1485 SiargFlac)U169 zah-735::Tt\IO otsX wl4M < 

0519 W1485 MargF-lac)UI69 zah~73S::TnlO c\si53i Ami tatZI2<t Ami <iuA' wl4 «, 
OS36 LCB107 MorgFlat)UI69 zah-735::TnlO &otsA-laiZ)7 otsX Lt:9lol supE44 
PF1 W1485 A/ac rys/5J<Aml /ar2/^5f Ami «m*w,«< 

PF2 W1485 Mac <XotsB*iacZ)8 cysl53lAm\ «m*wi«5 

PFll BUO zih-730::TnlO 

RH9C MC4100 rpo5.WV::Tn/0 

SH205 HfrC phoAB glpD3 glpR2 relAI umA22 Ui MargF-tacWKfi zah-73S::TniO 

UE3 F* KL16 rAi MpisHl-rrr) galR treA::latO 

W1485 F* wild lypc 0W* wl4ll , 



M. W. £saoo(l7) 
CGSC ttfe 
PKSH2D5 * W14K5 
PlfSHV* * DC931 
52 

TV of 0519 
PKFFUIS x PFi 
PKGD1) * 8L50 
R. Henoe-Aronis (30) 
E. Breocr <4*> 
8 

CGSC 5C4 



Phages 
PI 

A NK561 



cml clr-100 

W21 ri::Tn/0 Oam29 />am80 



38 

E. Brcn 



r(49) 



Plasmids 
cos4 

pACYClM 

pFFl 

pFFlOl 

pFF102 

pFF106 

pFF109 

pFF114 



Ap' 

Tc f Cm f 

Ap'; vector co$4 

Ap'. vector cos4 

Ap'; vector cos4 

Cm r ; vector pACYC184 

Ap f ; vector cos4 

Cm f ; vector pACYClW 



46 

12 

This stutf? 
This study 
This study 
This stud? 
This stud? 
This stud? 



- otsX is allelic won katF . apptf. rii-2. and rpoS. otsX carries an amber mutation in many K-12 strains (see teat). The otsX genotype is Am where it is 
relevant and when known, the origin of the oisX allele is indicated by a subscript. All strams described as carry a (kletioo generated t* selecting aTc 
derivative from a strain carrying MamF-hc)Ut69 z>h-735::Tn/0. The XantF -tut tUI69 dektion ts 100% cotransducible with the transposon 14* TW symbol ♦ 
indicates that the strain carries a lacZ opcron fusion generated by a x piac Mu53 or X p/«c Mu55 insertion. 

* CGSC £ eoti Genetic Stock Center. Yak University. New Haven. Conn. All CGSC strains were obtained from B. J. Bachmann. 

«- -These strains carry a X ptorMu35 insertion in which the kanamycin resistance marker of the prophage is deleted together with otsA::JniO 

* We do not know whether the &tcys::TntO) deletion encompasses the otsX wuas allele. 

* The katF aflck rpoS3S9:UtO is identical with csh2::TnW. This mutation was generated m MC4100 (30). 



assayed on agar plates (30). Trehalose-6-phosphate was a gift 
from Marine DNA, Troms0, Norway. 

RESULTS 

Identification of oisX. Most suppressor-free K-12 strains 
(e.g., sup 0 derivatives of W1485) display a lower expression 
of otsA-lacZ and otsBAacZ operon fusions than their coun- 
terparts with an amber suppressor (i.e., supD* supE. or 
supF). However, the much-used 5wp°-containihg strain 
MC4100 is an exception; its derivatives display the same 
high expression of ots-tacZ fusions as strains containing 
sup * (52). To make certain that our strain of MC4100 did not 
display a high expression of the otsA and otsB genes because 
of an unknown amber suppressor, we verified that a met- 
F{Am) mutation was not suppressed in this genetic back- 
ground (see Materials and Methods). 

In order to insert Tn/0 near the gene which activated otsA 
and otsB in MC4100, we prepared a culture of MC4100 with 
random Tn/0 insertions. A PI lysate grown on this culture 
was then used to transduce PF2 [W1485 sup 0 otsBJacZ 
cyj/(Am)J, which displayed a low expression of its lacZ 
fusion, to Tc r . The Tc r transductants were scored for growth 
on plates with medium 63-lactose. On this agar medium, 
strains with high or low expression of lacZ fusions in otsA or 
otsB are known to display a Lac* or Lac" phenotype. 
respectively (52). Out of 800 Tc r transductants tested, 6 were 
Lac*. 



Our rationale for using PF2 [cyj/(Am)J as the recipient in 
this cross was to detect whether a Lac * phenotype of the 
transductants was caused by a spontaneous amber suppres- 
sor mutation. But fortuitously, all six Tc r Lac" transduc- 
tants selected appeared to be Cys\ When PI hsates pre- 
pared from these Tc r Cys* transductants were used to 
transduce DC931 (cyr/(Am) lacHAm)] to Tc f . *e obtained 
Tc r Cys* transductants at a high frequency, bu gone was 
Tc r Lac*. Furthermore, when the same lysates »tre used to 
transduce AT2427, which carries a cysJ point DBQtion. we 
also obtained Tc r Cys* transductants. Thus, these crosses 
established that the gene in MC4100 that had the ability to 
provoke a high expression of the tacZ fusions in a sup 0 
background maps near cysIJ at 59 min. We tentatively 
named this gene otsX. and for clarity we name the MC4100 
allele o7jJT MC 4ioo **** *** W* 4 ** allele otsX wimy In other 
words, alleles that caused elevated transcription of the otsA 
and otsB genes in a sup 0 background are designated otsX*. 

otsX genotype and strain coostroctjoo. In order to know the 
origin of the otsX allele of the constructed strains, we first 
checked the otsX genotype of the basic strains. This was 
done by transducing their otsX allele together wkh a neigh- 
bor selectable marker into two recipient strains, oce carrying 
the otsX* MC 4ioo allele and thc othcr * e ot&mm allele. 
Both recipients were sup 0 Mac and carried either otsA-lacZ 
or otsBJacZ. When the donor and recipient had the same 
otsX genotype, all transductants had the same Lac pheno- 
type as the recipient. When the donor and the rtcpent had 
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tri mutS otsX cyt fuc 

. « ♦ f « — , * 

56 59 60 mm 

(A) , o*\ 

(B) ^ , 

(C) 

(D) ° 13 , 

FIG. 1. Mapping of otsX by coiramductional analysis. The val- 
ues given on ihe anows show the PI cotransduction frequencies 
between otsX and various markers in the 59-min region. The arrows 
point at the selected marker, which in al cases was a Tn/0 insertion. 
All recipients were Mac and carried an otsB-lacZ operon fusion, and 
the otsX genotype of the Tc r transductasts was scored by determin- 
ing their Lac phenotype on agar plates with medium 63- lactose. 
From each cross, at least 100 Tc r transductants were screened. 
Cross A, donor IK12 <o*::Tn/0 otsX~ MC4loa ) and recipient IK4 
(otsX wl4ts ); cross B. donor IK1 {fuc::l%iO otsX* MC4loa ) and recip- 
ient PF2 (otsX wt4ti ); cross C. donor IK21 imutS::lnlO o§sX w ^) 
and recipient FF1112 CofiJr*c4iao>: ooss D. donor 1K6 Ur/::Tn/0 
owA^mcmoo) and recipient IK4 (otsX Wim3 ). 



differing types of otsX alleles, the transductants would be of 
both Lac phenotypes. By setting up crosses in which the 
otsX genotype of the donor and tbe recipient differed, we 
could always pick a progeny with the desired otsX alkie. 
This strategy was therefore used m the strain constructions 
listed in Table 1. Beside MC4100. we found that ES1481 
carries an otsX* allele, whereas JC10240. N3002. and DC906 
are otsX. 

otsX* -containing strains that also were otsA * otsB~ could 
be distinguished from isogenic otsX mutants by their osmot- 
icaily somewhat more tolerant phenotype, but this pheno- 
type was not strong enough to be used for positive selection 
in a transduodenal cross. We also avoided a direct selection 
for a Lac* phenotype of otsAAacZ or otsBAacZ mutants, 
because strains carrying a fusion together with an otsX allele 
gave rise to spontaneous Lac" mutants when streaked on 
agar medium with lactose as an energy source. Ten such 
mutations, examined by transduction*] analysis, appeared to 
be linked to the fusion: thus, they seemed to be promoter 
mutations (data not shown). 

Transduction!] mapping of otsX. We constructed strains 
which carried fitcr.TnW. cysC::TnI0. mutS::TniO. or 
sri::TnI0 together with either the otsX* MC4W or the 
otsX WX4%s allele. These strains were used as donors in 
transductional crosses in which the recipient carried an otsX 
allele of the opposite category and an otsB-lacZ fusion. Tbe 
transductants were selected for the Tn/0 marker (i.e.. TcO 
and the cotransduction frequency of otsX and Tn/0 was 
determined by scoring the Lac phenotype. The four crosses 
outlined in Fig. 1 placed otsX almost exactly at 59 min. 

In a three-factor cross, a PI lysate prepared from strain 
IK68 (otsX* mutS::Tt\lO cysQ was used to transduce strain 
IK4 (otsX wlw otsB-lacZ) to Tc/. Of 175 transductants 
tested. 54 had received only mutS::TnlO> 89 had received 
mutSr.TnW and otsX*, and 32 had received mutS::TnW. 
otsX*. and cysC. The lack of transductants which had 
received nwtSnTnlO and cysC. but not otsX*. showed that 
the gene order was mutS-otsX-cysC. This is in accordance 
with data presented in Fig. 1. 

In addition to otsX. katFdl). appR (53). and csi-2 IrpoS) 
(30) have been mapped to 59 min (41. It has been shown that 



TABLE 2. Influence of the otsX ikatFi allele and genetic 
background on 0-plactosidase expression from 
otsA-iacZ and otsB-tavZ operon fusions 



0-GaUciosi<iase 
actr*ir> iL'mg of 
proceui* when 

Strain Description" J^** 1 ,n: 



LOM + 
LOM 0.3 M 
NaCI 



IK3 


W1485 sup 9 OtsA-iacZ otsX wlm$ 


3> 


62 


IK4 


W\m sup 9 otsBJacZ ot*X wt4MS 


10 


44 


IK41 


W1485 supE onB-iocZ otsX Wi4ts 


77 


780 


IK19 


W1485 sup 9 otsB-lacZ oiiX'^^a, 


120 


850 


IK35 


W1485 sup 0 otsBJacZ otsX x CBW 


9 


39 


FF1U2 


WC4100 sup 9 oisB4acZ alsX'^xo* 


130 


540 


IK44 


MC4100 sup 9 otsAJocZ otsX wtms 


18 


57 


IK58 


LCB107 supE onA-tacZ otsX Lcmw 
IXB107 sup 0 otsAJaeZ otsX^w 


360 


1J00 


IK59 


33 


89 


IK63 


LCB107 sup 9 otsA-bcZ oisX^^^ 


450 


1.800 


IK65 


MC4100 sup 9 otsB-bcZ katF iTMO 


5 


18 


IK69 


W1485 sup 9 otsB heZ katF. JnIO 


7 


21 


IK70 


W1485 supE otsB-lecZ katF.iTnlO 


8 


22 


MC4100 




0 


0 



" All strains are Hoc. 

* One urtrt of cruyme activity » 1 omol of o-mtrophenoJ fonrvd per min at 
28*C. Each value is an average of n least three independent measurements. 
Standard devoaoos were within = 15^. 



katF is allelic with appR (54), as well as with csh2 irpoS) 
(30). and our results show that otsX is another allele of this 
locus (see below). 

Influence of the oisX aBeJe and the genetic background 00 
oisA and oaB expression. To measure quantitatively the 
influence of the otsX alleles on the expression of otsA and 
otsB. we assayed the f}-galactosidase activity of lacZ fusion- 
containing strains with known genetic backgrounds. The 
activity was assayed after growth in a medium with or 
without 0.3 M NaCI added. otsA-tacZ or otsB-lacZ fusions in 
the same genetic background displayed similar values for 
3-galactosidase (19. 52). Therefore, in most cases only one 
type of fusion is included for each combination of genetic 
background and otsX allele (Table 2). 

In accordance with data presented before (52), strain IK4, 
which is a suppressor-free strain with a YV1485 background 
and the native otsX wl4U allele, displayed low 3-galactosi- 
dase activity. Furthermore, the activity was much higher in 
W1485 derivatives that carried cither a supE mutation or the 
otsX* MC4loa allele, i.e.. IK41 and IK19, respectively (Table 
2). 

The difference between W1485 and MC4100 derivatives 
with respect to their expression of otsA-lacZ and <xsB-IacZ 
fusions was solely due to the otsX alleles, since the genetic 
background, MC4100or W1485, was of no importance when 
the otsX allele remained the same. Thus. IK44 (MC4100 
otsX wl4ts ) displayed a p-galactosidase activity similar to 
that of IK3 (W1485 otsX^^h and FF1112 (MC4100 
otsX* MC4l00 ) displayed an activity simitar to that of IK19 
(W1485 orxJr^ MC4l00 ) (Table 2). 

We have reported previously that LCB107 derivatives 
display a higher expression of otsA and otsB than MC4100 
derivatives, and we have concluded that this property is not 
due to the otsA and otsB genes themselves (52). The otsX 
allele of LCB107 was not tbe cause of the elevated expres- 
sion of the otsA and otsB genes, since the isogenic oiiB-tacZ 
fusion-containing strains IK4 (W1485 otsX wl4ts ) and IK35 
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(W14S5 otsX LCBl01 ) displayed similar low 0-galactosidase 
levels. 

From the construction of LCB107. it was uncertain 
whether it carried supE44 (5). Our study showed that 
LCB107 carries a supE mutation (see Materials and Meth- 
ods). The j//p°-containing strain IK59. demed from 
LCB107. displayed strongly reduced expression of the otsA- 
IacZ fusion compared with that of the parentaJ supE-con- 
taining strain IK58. However. supE was not responsible 
for the superexpression of the otsA and otsB genes in 
LCB107 (i.e.. expression above the MC4100 level), since 
IK63 (LCB107 sup 0 otsX* MC4XOO ) expressed the oxsAAacZ 
fusion at the same very high level as IK58 (LCB107 supE 
otsX LCBW l Apparently. LCB107 carries a mutation in an as 
yet unidentified gene which influences the expression of the 
otsA and otsB genes. 

otsX b aOelk with katf and the amber-mutated gene of £. 
coii K-12. We found that all sup°<on taining strains listed in 
Table 2 carrying the otsX WX4ts or the otsX UCBl0J allele, and 
also £. coii K-12 itself, displayed the glycogen-negative and 
catalasc-negative phenotype that is found in katf mutants 
(30). while all strains containing 0/5** MC4 ,oo or sy ? £ did not 
show these defects. Also, when a katFv.TnlO mutation was 
introduced into FF1112 (MC4100 sup 0 otsX^^^h the 
resulting strain. IK65. did not display the high expression of 
the IacZ fusion seen in the parental strain, la fact, the 
3-gaJactosidase activities found were even lower than that of 
IK4 (W1485 sup 0 otsX WX4M5 ). indicating that the katF::Tn!0 
mutation reduced otsA and otsB gene transcription even 
more than otsX wl4Mi (Table 2). The simplest explanation of 
the present data is that otsX. katf, and the amber mutation 
described previously (47. 52: see above) are allebc and that 
this gene has reverted to an active form in MC410O. 

An alternative explanation is that the amber mutation was 
situated in another locus and was suppressed by the otsX* 
(katF~) allele of MC4100. If so. the ouJT MC410D allele and 
tRNA amber suppressors would represent two different 
routes for intergenic suppression of the amber mutation. 
However, the latter explanation was ruled out by the finding 
that IK70 (W1485 supE katf ::TnJ0) expressed the otsAAacZ 
fusion at the same low levd as IK69 (W1485 sup 0 
katF::TnlO) (Table 2). Apparently, in these strains the 
katF::Tt\J0 allele had replaced the amber-mutated gene so 
that suppression via supE was no longer possible. 

It is well known that many K-12 strains lose their viability 
rather rapidly when stored as colonies on LB agar in the 
cold. We found that this phenotype was linked to the otsX 
(katF) genotype. This loss of viability was not caused by 
decreased trehalose synthesis, since otsX* {kaif~) strains 
carrying an otsA or otsB mutation remained viable for a 
prolonged period (data not shown). 

otsX {katF) does not activate proV and bet. To investigate 
the influence of otsX on the transcription of the osmotically 
regulated betB, betT (17), and proU (15) genes, we used 
suppressor-free strains which were blocked in trehalose 
synthesis by an otsA::TnI0 mutation (19). Thereby we could 
eliminate any indirect effect of otsX mediated via trehalose 
synthesis. As shown in Table 3. the p-galactosidasc activi- 
ties of cells with a betB-lacZ or bctTAacZ operon fusion 
were essentially the same whether the cells carried the 
otsX* UC41O0 or the otsX wl4MS allele. The same lack of effect 
of otsX was obtained for strains canying a prolIAacZ fusion 
(data not presented). Thus, otsX is not a general activator of 
osmotically regulated genes. 

Cloning of otsA and otsB. Strains which are defective in 
trehalose synthesis display an osmotically sensime pheno- 



TABLE 3. Influence of otsX (katf) allele 00 the osmotic 
induction of 0-galacrosidase activity of wTJacZ 
and hftB-lucZ operon fusion mutant* of £. toit 



Strain 



Dctcnpttov 



&-G*acto»iduc activity 
<L «f of protctnt* 
at u\\% frown in: 



LOM 



LOM * O.J M 
NaO 



IK49 


MOIO o be\14ucZ 


150 


600 


IK50 


*>tsX WX4M < bttT-UcZ 


160 


670 


IK51 


of*** mo 100 bet&JaeZ 


350 


670 


IK52 


otsX wl4as brtB-tccZ 


320 


750 



' All straMO are Hoc. 

* One unit of enzyme activity a 1 nmol of p-attropoeaoJ formed per min at 
28T- Eacn value is an avcrate of at least three independent measurements. 
Standard deviations were withia = ISft. 



type (19). By introducing a cosmid library into FF4169 iotsA) 
and selecting for strains growing on agar with medium 
63 -glucose with 0.45 M NaCl added, we obtained plasmid 
pFFl. This plasmid carried a chromosomal insert of 43 kb 
(Fig. 2). From pFFl we constructed plasrad pFF106 carry- 
ing a 2.9-kb /Y/ndlll chromosomal fragmca. This was the 
smallest fragment obtained that restored trehalose synthesis 
in both otsA and otsB mutants, as well as in Mots A otsB) 
mutants. We also constructed the subclones pFF109 and 
pFFl 14. which carried overlapping chromosomal fragments 
with lengths of 1.85 and 11 kb. respectively, covering the 
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FIG. 2. Restriction site analysis of ibe otsBA region. (A) Restric- 
tion map of the 41- to 42-min region of the £. «fi chromosome as 
published by Kohara et al. (28); the coordinates are in kDobases 
from thr. The localization and the direction of transcription of the 
otsBA operoQ are indicated. (B) Restriction map of the chromo- 
somal part of plasmid pFFl and its subclones pFFlOl and pFF102. 
aligned with the map of Kohara et al. (O Enlarged restriction map 
of the 2.9-kb /V/ndlll fragment containing the otsBA operon. The 
extensions of the chromosomal part of plasmid pFF106, pFF109. 
aod pFF114 are shown, and their genotypes are indicated. Abbre- 
viations used for restriction sites are as follows: B. BcmH\; Bl. Bg(l: 
Bg. Bgdl: C CM: E. Ecokl: EV. £coRV; H. HmdllU He. Hindi: 
K. Kpnl: Ps. Pst\: Pv, PvuXh and S. Soli. 
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TABLE 4. Ovnotic regulation of the trehalose-6-phosphate 
synthase and the trehalose-*- phosphate phosphatase activities in 
£. coii strajni with and without ots mutations and plasmtds 



Synthase activity Phosphatase acimtv 
(U/m| of protein r* in lU/mf of protein t* in 
cells exposed to: cetis exposed to: 





M63 


M6J ♦ 0 4 M 
NaCI 


M63 


M63 ♦ 0.4 M 
NaO 


FF4171 


2 


17 




K8 


FF4026 (otsAlacZ) 


0 


0 




100 


FF4025 iotsB-hcZ) 


0 


0 


28 


28 


FF4050 \MotsA <xsB)\ 


0 


0 


22 


22 


FF4050(pFF106i 


26 


170 


66 


200 


FF4030(pFF114l 


a 


15-90- 


14 


14 


FF4050fpFF109l 


0 


0 


71 


530 


FF4057(pFF106» *otsX) 


NO 


27 


ND 


50 



* All strains wen derived from MC4100. otsX is allelic with kaif. 

* Enzyme activcas were measured in toluene -treated cells. One unit of 
enzyme activity is 1 amoJ of trehalose formed per min at 3TC. Each value is 
an avert** of at least four independent measurements. Standard deviations 
were within s 10* far the synthase activities, except for FF4O30*pFFl 14) (sec 
text), and within for the phosphatase activities. ND. not determined. 

* The synthase activity displayed la/ye variations (sec text). 



insert of pFF106 (Fig. 2). Properties of these three plasmids 
are described below. 

Expression of synthase and phosphatase activities. We 
previously described an assay for trehalose-6-phosphate 
synthase in E. coii (19, 52). In this work we have developed 
an assay for the phosphatase of the trehalose pathway, using 
synthetic trehaiose-6-phosphate as a substrate and determin- 
ing, by using gas chromatography, the amount of trehalose 
formed. In order to measure the osmotic induction of the 
enzymes, we grew the cells in medium 63 and then increased 
the osmolarity of the medium by addition of 0.4 M NaCI. 
Strains with intact trehalose synthesis which grew in the 
presence of 0.4 M NaCI were incubated for one generation, 
whereas strains with a defective trehalose synthesis were 
incubated in the high-osmoiarity medium for 2 h. All strains 
used were derivatives of MC4100 and carried a treA muta- 
tion to prevent in vitro degradation of trehalose (19). Except 
for FF4057. they were all otsX* (katf*). 

Strain FF4171 is wild type with respect to trehalose 
synthesis. As shown previously (19, 52h FF4171 has an 
osmoticaUy inducible synthase, whereas related strains car- 
rying an otsA. otsB. or MotsA otsB) mutation lack this 
synthase activity completely (19) (Table 4). In addition. 
FF4171 displayed an osmotically induced phosphatase activ- 
ity. The phosphatase activity was also osmotically induced 
in FF4026 (otsAh but not in FF4025 (otsB) and FF4050 
IMorsA otsB)], in which it was the same whether the cells 
were osmotically stressed or not (Table 4). Apparently, the 
otsB mutation blocked the synthesis of the osmotically 
inducible phosphatase, but it did not influence the back- 
ground phosphatase activity also found in nonstrcssed cells. 
It is well known that it is often difficult to measure accurately 
separate phosphatase activities in wild-type E. coii. This is 
because of the multiplicity of phosphatases in £. coii and the 
lack of specificity of some of them (42). 

Strain FF4050 [MotsA otsB)] carrying plasmid pFF106 
displayed strongly elevated activities of both the synthase 
and the phosphatase. Both activities, after subtraction of the 
background phosphatase activity, increased about fivefold 
when FF405(XpFF106) was exposed to osmotic stress. The 
synthase and phosphatase activities of strains carrying plas- 



mid pFF106 iotsA * otsB*) were much lower in the otsX 
(katF) mutant strain FF4057 than in its otsX* counterpart 
FF4050(Table4). 

Subclone pFF114. which restored trehalose synthesis in 
synthase-defective otsA mutants, conferred only synthase 
activity to the MotsA otsB) mutant FF4050. Subclone 
pFF109 conferred only phosphatase activity to FF4050 (Ta- 
ble 4). As would be expected. pFF109 did not restore 
trehalose synthesis in either otsA or otsB insertion mutants, 
since these mutants lacked the synthase activity. The ele- 
vated enzymic activities displayed by the present plasmid- 
carrying strains and the finding that the enzymes can be 
expressed separately indicated that the plasmids carried 
structural rather than regulatory genes. Presumably. otsA 
and otsB were the structural genes of the synthase and the 
phosphatase, respectively, and the lack of synthase activity 
in otsB mutants was because of a polar effect. 

Difference in copy number may explain why pFF109 
(pBR322 derived) appeared to confer a much higher phos- 
phatase activity to osmotically stressed cells than did 
pFF106 (pACYC derived) (Table 4). The lower ratio of 
osmotic induction of the phosphatase seen with pFF106 
compared with pFF109 may be because of a constitutive 
contribution to otsBA transcription from a vector promoter 
in pFF106. Stuber and Bujard (SI) have shown that 
pACYClW contains a strong promoter situated just up- 
stream of the otsBA insert in pFFlOo. 

Addition of spectinomycin (300 jig ml" 1 ) reduced strongly 
the osmotic induction of the enzyme activities in all plasmid- 
carrying strains (data not shown). Thus, the observed os- 
motic effect depended on de novo protein synthesis. It 
should, however, be noted that the synthase activity of 
stressed cells of FF4050(pFF114) was very variable (Table 
4). This was probably due to a toxic effect of intracellular 
trehalosc-6-phosphate. which accumulated in the absence of 
the specific phosphatase activity. This notion is supported 
by the finding that when FF40504pFF114) was subjected to 
an osmotic upshock. the survival of the cells, as determined 
by counting CFU on LB plates, was only 50% after 2 h and 
10% after 4 h. It has been reported previously that intracel- 
lular trehalose-6-phosphate is inhibitory to both 5. cerevisiae 
(41) and E. coii (27). 

Physical mapping of otsBA. The 43-kb chromosomal frag- 
ment of pFFl was mapped with BamHU HindllL and Sail. 
The subclone pFFlOl was in addition mapped with Pstl and 
EcoRI, and the shorter subclone pFF102 also with Bgilh 
PvulU and Ctol. In Rg. 2. the restriction maps of the 
chromosomal fragments of pFFl and its subclones have 
been aligned with the 41- to 42-min region of the physical 
map of the E. coii chromosome prepared by Kohara et al. 
(28). A very good agreement was found for all enzymes used 
by both Kohara et al. and us. The only exception was the 
Hindlll site furthest to the right in pFFl, which was not 
reported by Kohara et al. This is the Hindlll site limiting the 
right-hand side of the chromosomal insert of pFF106 (Fig. 2). 

In conclusion, the 2.9-kb Hindlll fragment which comple- 
ments MotsA otsB) mutants is localized around the coordi- 
nate 1,992 kb in the physical map of the E. coii chromosome 
(28). That is 8 to 11 kb clockwise of tar (41.6 min). On the 
basis of the localization of the chromosomal fragment of 
subclones pFF109, conferring only OtsB (phosphatase) ac- 
tivity, and pFFl 14, conferring only OtsA (synthase) activity, 
it was evident that otsA is located proximal to tar with 
respect to otsB. This localization and gene order is in 
agreement with our previous mapping data (19). However, 
on the latest chromosomal linkage map of E. coii these ots 
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genes are wrongly placed between fliA and uvrC (4) and not 
in the con-ect position between uvrC and flhD (ftbB) (19). 

It has been shown previously that a Tn/0 insenion in otsA 
does not influence the expression of a IacZ fusion in otsB 
(19). However, the present biochemical data showed that an 
insertion in otsB prevented the expression of otsA (Table 4). 
This polar effect suggests that otsBA constitutes an operon. 
The operon model is also supported by the finding that when 
preparing deletion mutants from FF4012 (MC4100 otsA:: 
Tn/0 orsB-lacZ o«Jf* M c*:oJ by selecting for Tc* and scor- 
ing for the Km Lac phenocype. we obtained a few strains, 
e.g., strain FF4035. which were Tc* Km* but remained 
Lac*. The otsBAacZ fusion in FF4035 remained under the 
control of the native ouA'ffcir/Vdependent promoter since it 
became Lac" when an otsX wl4M5 allele was introduced to 
generate strain FF4055. Apparently, the deletion of otsA:: 
Tn/0 from FF4012 had been accompanied by a deletion of 
the kanamycin resistance marker, but not the IacZ gene of 
the X piacMu insertion. The kanamycin marker is located 
distal to the promoter controlling the expression of IacZ in 
the intact A p/acMu prophage. Thus, these data showed that 
the otsB gene is transcribed in the direction towards otsA. 
i.e.. in the counterclockwise direction on the chromosome. 
We assume that the transcription of otsA in pFF114 (Table 4) 
starts from a promoter which is not functional on the 
chromosome. 

DISCUSSION 

The otsA and otsB genes have been mapped previously 
(19). In this study we have cloned these genes and deter- 
mined their localization oo the physical map of the £. coli 
chromosome prepared by Kohara et al. (28). We present 
evidence that otsB and otsA constitute an operon in which 
otsB, encoding trehalose-6-phosphate phosphatase, is prox- 
imal to the promoter, and otsA. encoding trehalose -6-phos- 
phate synthase, is distal. Thus, a recently described point 
mutation named otsP, which maps near otsA and seems to 
influence only the expression of the phosphatase, is probably 
allelic with otsB (27). 

For 5. cerevisiae, the synthase and the phosphatase of the 
trehalose pathway have been copurified as a complex and 
the individual enzymic activities have not yet been resolved 
(33). This enzyme complex may not exist in £. coii, since the 
OtsA and OtsB activities could be expressed separately in an 
otsBA deletion mutant. Also supporting this is the finding 
that we were unable to regain the synthase activity from cells 
disrupted in a French pressure cell, while the phosphatase 
activity remained stable (data not shown). 

It has been shown previously that the variation among 
K-12 strains with respect to their capacity to synthesize 
trehalose and to express otsBA is because of an amber- 
mutated gene in E. coli K-12 and many of its derivatives (47. 
52). In this study we have identified this mutated gene as 
katF , which has recently been reported to be allelic with 
appR (54) and csi-2 (rpoS) (30). The DNA sequence indicates 
that it encodes a sigma factor (39). Our demonstration that 
katF carries an amber mutation in E. coli K-12 and W1485 
(wild type), but not in MC4100, is in agreement with earlier 
reports that appR is inactive in many K-12 strains (16, 53) 
and that MC4100 is among the strains in which appR is 
active (16). 

Genes which display increased expression when the cell is 
exposed to a particular external stimulus or stress are often 
grouped into stimulons. A stimulon can comprise several 
regulons. and many genes belong to more than one stimulon. 



For instance, carbon starvation (37) and osmotic stress (10) 
are reported to increase the synthesis of at least 54 and 41 
proteins, respectively, and 5 osmotic-stress proteins are 
reported to be among the starvation proteins (25). In exper- 
iments with strain MC4100. it has been shown that katF 
positively controls 15 to 20 genes expressed during carbon- 
starvation-induced entry into stationary phase (30). Ho\* 
ever, the characterization of stimulons in £. coii has bet 
carried out with different strains. Thus, the fact that £. an. 
K-12 and many of its derivatives carry an amber mutation in 
katF may have made investigators overlook ta/F-dependem 
induction. This has to be taken into account in further 
evaluations of £. coii stimulons. 

The katF gene is known to be transcriptionally regulated, 
and it has been suggested that slow growth is a signal for its 
transcription (30. 40). Since osmotic stress slows the growth 
of £. coii. it probably causes katF induction, and this may 
partly explain the observed osmotic regulation of otsBA 
transcription. However, it is unlikely that this is the only 
mechanism, since the otsBA transcription, albeit being low. 
was osmotically regulated also in strains carrying katF::Tnl0 
or the amber-mutated katF gene. 

proU transcription is dependent on the housekeeping 
sigma factor a 70 (26). and the osmotic regulation of proU can 
be explained by a stimulatory effect of potassium glutamate 
on the transcriptional complex (43). independent of DNA 
supercoiling (44). Apparently, an intrinsic property of the 
osmotic stress-dependent proU promoter elicits this potas- 
sium glutamate effect without the need of protein factors, at 
least in vitro (43). A similar effect of potassium glutamate on 
the otsBA promoter could explain the osmotic induction of 
otsBA seen in strains mutated in katF. 

Trehalose protects organisms against a variety of stresses 
(see the introduction). The finding that transcription of 
otsBA. as opposed to proU and bet. is katF dependent 
nourishes a notion that trehalose, in addition to being an 
osmoprotectant, may serve as a more general stress pro- 
tectant in £. coli. 
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Molecular mechanisms of osmotic stress tolerance were 
studied in Mesembryanthemum crystallinum (ice plant), 
a facultative halophyte capable of adjusting to and 
surviving in highly saline conditions. We screened a 
subtracted cDNA library enriched for salt stress-induced 
mRNAs to identify transcripts involved in this plant's 
adaptation to salinity. One mRNA, Imtl, was found to 
be up-reguiated in leaves and, transiently, in roots. 
Nuclear run-on assays indicated that this mRNA is 
transcriptionally regulated. Imtl encoded a predicted 
polypeptide of M, 40 250 which exhibited sequence 
similarity to several hydroxymethyl transferases. Expres- 
sion of the protein in Escherichia coli and subsequent 
activity assays identified the protein as a novel myo- 
inositol 0-methyl transferase which catalyzes the first step 
in the biosynthesis of the cyclic sugar alcohol pinitoi. 
Pinitol accumulates in salt-stressed M. crystallinum and 
is abundant in a number of salt- and drought-tolerant 
plants. The presence of high levels of sugar alcohols 
correlates with osmotolerance in a diverse range of 
organisms, including bacteria, fungi and algae, as well 
as higher plants. The stress-initiated transcriptional 
induction of IMT1 expression in a facultative halophyte 
provides strong support for the importance of sugar 
alcohols in establishing tolerance to osmotic stress in 
higher plants. 

Key words: gene expression/myo-inositol 0-methyl trans- 
ferase/osmoprotection/salt stress/sugar alcohol 



Introduction 

For plants, environmental stress is an inescapable and 
persistent condition. Water stress imparted by drought or 
high salinity is the most widespread abiotic stress and 
constitutes the most stringent factor in limiting plant 
distribution and productivity (Boyer, 1982). A large body 
of physiological work has described how various species 
respond to water deficit. A number of salt tolerant plants 
(halophytes) have evolved strategies that enable them to 
adjust to salinity, avoid salinity-induced water stress and 
succeed in environments that will-jiot accommodate salt- 
sensitive species (glycophytes). Such strategies can include 
changes in photosynthesis, alterations in membrane structure, 
exclusion of salts, accumulation and partitioning of ions, 
increases in intracellular organic solutes and precocious entry 
into reproductive phase and senescence (Jefferies, 1981). 



There have been a number of recent investigations into 
the molecular responses of plants to water stress. Most of 
these studies have identified polypeptides that are induced 
in giycophytic plants during water stress imposed by drought 
(Gomez et aL, 1988; Mundy and Chua, 1988; Mason and 
Mullet, 1990; Cohen et aL, 1991; Plant et al., 1991). Bartels 
et oL (1990, 1991) have identified several mRNAs, including 
one encoding an aldose reductase, induced during drought 
in the resurrection plant, Craterostigma plantagineum, which 
can remain dormant yet viable during long periods of 
complete desiccation. A consistent theme in giycophytic 
water-stress responses is the expression of polypeptides 
which are also associated with abscisic acid (ABA)-mediated 
seed and embryo desiccation (Skriver and Mundy, 1990). 
Examples include Rab proteins (Mundy and Chua, 1988), 
Lea proteins (Dure et aL, 1989) and the aldose reductase 
mentioned above, which is induced in barley embryos as 
well as desiccating resurrection plant leaf tissue (Barrels 
et aL , 1991). The function of many of these proteins (e.g. 
Rabs and Leas) during stress is not clear. Many share some 
biochemical characteristics such as high hydrophilicity, and 
it is thought that their role in maturing seeds and stressed 
leaf tissue is to preserve the structural integrity of ceils during 
the desiccation process (Skriver and Mundy, 1990). 

The effects of salinity on glycophytes have also been 
investigated. Claes et aL (1990) identified a salinity-induced 
rice mRNA encoding a protein of unknown function. Work 
with tobacco in tissue culture has established that salt-adapted 
cell lines exhibit changes in ion compartmentation (Binzel 
etaL, 1988), cell wall diemisrjy (Iraki et aL , 1989) and gene 
expression (Singh etaL, 1989). 

Studies with glycophytes have established the importance 
of gene expression in plant stress responses and defined 
classes of proteins that are induced by desiccation, but most 
of these investigations have not been successful in distin- 
guishing pathological from stress-ameliorating responses. We 
have taken advantage of a unique system, the inducible 
halophyte Mesembryanthemum crystallinum (common ice 
plant), in order to understand the mechanisms by which some 
' plants avoid salinity-induced water deficit. As a facultative 
halophyte, the ice plant undergoes a set of stress-inducible 
physiological and biochemical changes that allow it to adjust 
and maintain cell viability and turgor, conserve water and 
continue growth during extreme salt stress. One physio- 
logical adaptation, the stress-induced switch from C 3 to 
Crassulacean acid metabolism (CAM), has been studied in 
depth (Winter and von Willert, 1972; Bohnert etaL, 1988 
and references therein). CAM is an alternative photosynthesis 
pathway that allows night-time carbon fixation, thereby 
resulting in decreased evaporative water loss (Ting, 1985). 
The shift to CAM has been shown to require increased 
expression of mRNAs encoding a number of CAM pathway 
enzymes (Ostrem et al. , 1987; Vernon et aL , 1988; Schmitt 
etaL, 1988; Ostrem etaL, 1990), establishing Mesem- 
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bryanthemum as an attractive system for the study of salt 
stress at the molecular level. 

CAM is considered to be separate from the ice plant's 
ability to adjust to and tolerate high salinity at the cellular 
level. CAM is a complex physiological phenomenon that 
serves as a long-term water conservation strategy, and is not 
fully operative until 7-14 days following environmental 
stress in 6 week-old M.crystallinum (Winter and Gademann, 
1991; Bohnert et a/., 1988). CAM is preceded by a period 
of transient wilting (2-4 days) followed by recovery and 
continued growth (Winter and Gademann, 1991; Bohnert 
et aL , 1988; D.Vernon, unpublished). It is likely that there 
are mechanisms distinct from CAM that are responsible for 
this observed adjustment to saline conditions. To investigate 
the molecular basis of salt tolerance, we generated and 
screened a subtracted cDNA library enriched for stress- 
induced sequences. Here we show that this facultative 
halophyte responds to osmotic stress by transcriptional 
induction of a gene encoding a novel methyl transferase. This 
methyl transferase is identified by functional assay as a myo- 
inositol 0-methyl transferase involved in the biosynthesis 
of a cyclic sugar alcohol, pinitol. Pinitol is abundant in a 
number of salt- and drought-tolerant plant species, including 
salt-stressed M.crystallinum, where it can accumulate to 
>70% of soluble carbohydrate (Paul and Cockburn, 1989). 
The transcriptional induction of this cyclitol biosynthesis 
enzyme in salt-stressed M.crystallinum indicates that the 
cyclic sugar alcohol pinitol may play a crucial role during 
adaptation to osmotic stress in this facultative halophyte. 

Results 
cDNA isolation 

To identify environmentally-induced molecular changes 
involved in the ice plant's adaptation to salinity, we 
constructed and differentially screened a subtracted cDNA 
library enriched for stress-induced sequences. cDNA was 
generated from poly(A) + RNA that had been isolated from 
7 week-old soil-grown plants stressed with 500 mM NaCl 
for 10 h. By this time, plants start to recover from stress- 
induced transient wilting (Winter and Gademann, 1991), but 
have not yet accumulated large amounts of mRNAs encoding 
components of the well characterized CAM pathway (Ostrem 
et ol. 9 1987, 1990). Three cycles of differential screening 
of -I0 5 plaques with labeled first-strand cDNAs from 
stressed and unstressed plants yielded eight consistently 
up-regulated inserts. ' Cross-hybridization experiments 
indicated that the inserts represented three distinct clones 
(data not shown). A 1.6 kb insert, now referred to as Imtl , 
was chosen for further analysis. 

Expression of fmtl mRNA 

Expression of the Imtl transcript was analyzed on Northern 
blots of RNA isolated from root and leaf tissue of hydro- 
ponicaliy-grown plants (Figure 1). Total RNA was isolated 
from unstressed plants and plants harvested at several 
timepoints over the course of a 6 day stress with 400 mM 
NaCl. The Imtl cDNA probe hybridized to a salinity-induced 
mRNA of - 1.6 kb in both leaf and root RNA. The pattern 
of induction differed in these two organs. In unstressed 
leaves, Imtl was present at very low levels, The transcript 
accumulated gradually in stressed leaves (Figure 1A), being 
detectable -after 6 h of stress, but inconspicuous until the 
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Fig. 1. Hybridization of the Imsl cDNA to Northern blots of 
M.crystallinum RNA. Total RNA was isolated from leaves (A) or 
roots (B) of hydroponically-grown 6 week-old control plants 
(unstressed) and plants that had been stressed with 400 mM NaCl for 
6. 30, 78 and 126 h. Control plants were harvested at each timepoint. 
10 ?% of leaf (A) or 20 >ig of root (B) RNA from each timepoint were 
resolved on formaldehyde -agarose gels, transferred to nitrocellulose 
and probed with 32 P-labeled Imxl cDNA. Auioradiographs of blots are 
shown. The root blot was over-exposed to illustrate the disappearance 
of the transcript in roots berween 78 and 126 h of stress. Transcript 
size is indicated in kb. 

second day, after 30 h of stress. Accumulation of the 
transcript in leaves reached a maximum by the sixth day of 
salt treatment (126 h). Imtl was transiently up-regulated in 
roots (Figure IB), rising from undetectable levels to a 
maximum level of expression during the second day of stress 
(30 h). Interestingly, the mRNA completely disappeared 
from roots by the time-of maximum expression in leaves. 
Blots of dilutions of leaf and root RNA indicated that, at 
the times of maximal expression, the Imtl mRNA is >25 
times more abundant in leaves than in roots (data not shown). 
A prolonged exposure of the root RNA Northern blot is 
shown in Figure 1 to illustrate the complete disappearance 
of this transcript between the fourth and sixth days of stress. 

Genomic Southern analysis 

The Imtl transcript induced in leaves and roots is encoded 
by a single nuclear gene or small gene family. Nuclear 
genomic DNA digested with various restriction enzymes was 
resolved by 1 % agarose gel and Southern-blotted along with 
genome copy number equivalents of the cloned cDNA 
(Figure 2). Blots were probed with 32 P-labeled cDNA 
fragments and signal intensities were quantified using a 
j8-scanner (Betagen Inc., Waitham, MA). Probes specific 
to either the 5 '-coding region (not shown) or the 3 '-non- 
coding end of the cDNA hybridized to single bands of equal 
intensity in each lane (Figure 2A). High stringency wash 
conditions were identical to those used for the Northern blots 
shown in Figure 1. Comparison of band intensities with copy 
number reconstirutions (Figure 2B) indicated that the bands 
probably represent a single gene. A blot probed with a 
400 bp 3 '-end fragment is shown. 
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Fig. 2. Southern analysis of M.crystailwum genomic DNA. (A) 4 jig 
of nuclear DNA were digested with EcoRl (lane 1), K/ndM (lane 2) 
or Hindi (lane 3). resolved by agarose gel electrophoresis and 
Southern bloned alongside known amounts of Imtl cDNA 
corresponding to 0.5, U 2. 4 and 10 copies per genome (B). The blot 
shown was probed with a 32 P-labeied 400 bp 3'-end fragment of the 
Imtl cDNA. Positions of size markers are shown in kb. 
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Fig. 3. Nuclear run-on assays of Imtl transcription. Imtl cDNA was 
slot-blotted onto nitrocellulose and probed with 32 P-iabeled transcripts 
generated by nuclei isolated from unstressed and saJt-stressed (102 h) 
M.crystatlinum. A PEP carboxylase cDNA clone (Ppcl) was included 
on blots as a positive control for transcriptional induction (Cushman 
et ai. 1989). Bluescript DNA (KS~) served as a control for 
background. Hybridization to 18S rDNA (I8S) was used to normalize 
total transcription rates of nuclei from stressed and unstressed plants. 



Nuclear run-on assays 

The Imtl mRNA is transcriptionally induced in leaf tissue 
(Figure 3). Nuclear run-on experiments were performed with 
nuclei isolated from control and salt-stressed M.crystollinum. 
Transcripts were labeled with ( 32 P]UTP and hybridized to 
excess Imtl cDNA slot-blotted onto nitrocellulose. Ppcl 
DNA. which encodes the transcriptionally induced CAM 
enzyme PEP carboxylase (Cushman etai, 1989), was 
included as a positive control. Hybridization to 18S rDNA 
was used to normalize the overall transcription rates of 
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Fig. 4. Nucleotide sequence and predicted amino acid sequence of the 
Imtl cDNA. Underlined regions share homoiogy with hydroxymethyl 
transferases from plant, bacterial and mammalian sources (Bugos 
et a/., 1990- The sequence data of Imti have been deposited in the 
DDBJ/GenBanJc Nucleotide Sequence Databases under the accession 
number M87340. 

different reactions. £-scan analysis of experiments such as 
that shown in Figure 3 indicated that transcription of the Irml 
message is four to five times higher in leaves from salt- 
stressed plants than in leaves of unstressed control plants 
of the same age. This increase is comparable to, and perhaps 
greater than, the stress-induced increase in transcription rate 
observed for the Ppcl transcript (Figure 3, Cushman et ai , 
1989). 

Sequence analysis 

To gain insight into the possible biochemical and physio- 
logical functions of the IMTI protein, the sequence of the 
cDNA was determined (Figure 4). The clone was 1524 bp 
long. It contained an A+T-rich leader sequence and an ATG 
stan codon followed by an uninterrupted reading frame of 
1095 nucleotides. The 3' end consisted of a long non-coding 
region of 383 nucleotides, which included two possible 
polyadenylation recognition sequences (AATAA and 
AATAAA) located 28 and 74 bases, respectively, upstream 
of a 31 base poly(A) tail. Analysis of the Irml sequence 
predicted a hydrophilic polypeptide of 365 amino acids with 
a molecular mass of 40 250 (Figure 4). A search of the 
NBRF database revealed similarity to a bovine pineal gland 
hydroxyindole 0-methyItransferase (Ishida et aL, 1987). 
Homolosy (including conservative amino acid replacements) 
was 55% over the enure length. Significant identity (27%) 
was confined to 195 amino acids in the C-terminal portion 
of the protein (alignments not shown). The predicted IMTI 
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polypeptide was found to be even more closely related to 
two plant Afunctional hydroxymethyl transferases which 
methylate the lignin monomers caffeic acid and hydroxy- 
fenilic acid (Bugos etaL y 1991; Gowri etaL, 1991). 
Comparison with these proteins revealed >50% identity 
over the entire length of the IMT1 sequence (not shown). 
Regions of shared homology with J-adenosyl-L-methionine 
(SAM)-dependent hydroxymethyl transferases (OMTs) are 
underlined in Figure 4 (Bugos et a/., 1991). 

Expression of IMT1 in Escherichia coii and 
identification of activity 

Although sequence alignments suggested a general bio- 
chemical function for the IMT1 protein, amino acid identities 
were not sufficient for the identification of the enzyme's 
substrate specificity. We hypothesized that one possible role 
for a methyl transferase in the salt stress response in 
M.crysiallimun was in the biosynthesis of pinitol (Id-3-0- 
methyl duVo-inositol). Pinitol is a methylated cyclic sugar 
alcohol (cyclitol) that accumulates to high levels in salt- 
stressed M.crystallinwn (Paul and Cockburn, 1989). Its 
appearance and accumulation parallel that of the Imtl mRNA 
(Figure 1) during the initial days of salt stress (Paul and 
Cockburn, 1989). Pinitol has potentially osmoprotective 
properties and is thought to serve as an intracellular osmolyte 
(possibly in the chloroplasts) during osmotic adjustment in 
this inducible halophyte (see Discussion). Pinitol is derived 
from the six-carbon cyclic sugar alcohol myo-inositol 
(Figure 5). Its synthesis involves an initial methylation of 
myo-inositol by a SAM-dependent hydroxymethyl trans- 
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Fig. 5. Pathway of pinitol biosynthesis in angiosperms. Pinitol 
(lD-3-O-mcthyl chiro- inositol) is synthesized from myo-inositol, a six- 
carbon ring polyol derived from glucose-6-phosphate. The synthesis 
pathway consists of a methylation step catalyzed by a position-specific 
S-adenosyl-L-methionine (SAM)-dependem 0-methyl transferase 
(OMT), followed by an enzymeotaJyzed epimerization which may 
require NAD* and/or NADPH (Loewus and Dickinson, 1982). The 
methylated intermediate is ononitol. Vertical bars on ring structures 
represent hydroxy! groups. 



ferase, followed by an enzyme-catalyzed epimerization via 
a keto intermediate. In angiosperms, the pathway proceeds 
via ononitol (lD-4-O-methyl myo-inositol; Figure 5). An 
isomer of ononitol, sequoyitol (lr>5-0-methyl myo-inositol), 
is the methvlated intermediate in gymnosperms (Dirtrich and 
Brandl, 1987). 

To substantiate the hypothesized physiological role of the 
IMT1 protein in pinitol biosynthesis, the polypeptide was 
expressed in E.coli and bacterial lysates were tested for 
myo-inositol hydroxymethyl transferase activity. As is 
evident from Figure 6A, E.coli containing the clone in the 
proper orientation expressed a protein of molecular mass 
-40 kDa that was not present in control cells (containing 
the clone in the opposite orientation) that were grown under 
identical conditions. This protein co-migrated on SDS —poly- 
acrylamide gels with the reticulocyte system in vitro 
translation product of the in vz/rcKranscribed Imtl clone 
(Figure 6B). Expression ranged from -1% to > 10% of 
E.coli protein, depending on growth conditions. 

We tested E,coli extracts from control cells and cells 
expressing the IMT1 protein for myo-inositol-dependent O- 
methyl transferase activity in an assay using l4 CH 3 -labeied 
SAM as a methyl group donor. Carbohydrates from post- 
assay extracts were separated by HPLC and radioactive 
fractions identified by scintillation counting. A radioactive 
product with a retention time of - 11.1 min was generated 
by extracts from cells expressing the IMT1 protein, but not 
by control extracts (Figure 7). The appearance of this peak 
was dependent on the addition of myo-inositol to the assays. 

A B 



+ » 




Fig. 6. Expression of IMT1 in Lcoti. The imtl cDNA expression 
constructs were transformed into Lcoli BL21(DE3) cells as described 
in Materials and methods. (A) SDS-PAGE of proteins from Lcoii 
transformed wim lmil cloned 3*-5' (-) or 5'-3* (+) behind the 
biuescript T7 promoter. (B) SDS-PAGE of in viiro translation 
products of unprogrammed reticulocyte lysate translation system (-) or 
lysate programmed with in w/nHranscribed Imti RNA (+). All 
samples in panels A and B were run on the same gel. which was cut 
and either Coomassie stained (A) or fluorographed (B). Positions of 
size markers are labeled in kDa. Arrow indicates position of IMTI. 
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Two less prominent peaks were generated by all assays 
(regardless of myo-inositoi addition or IMT1 expression), 
suggesting the presence of endogenous SAM-dependent 
OMTs \n E.coli. The activity was sensitive to both heat 
(70°C 15 min) and proteinase K treatment, and I4 C 
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Fig. 7. Myo-inositoi Omethyl transferase assays of E.coli extracts. 
Equal amounts of total soluble protein from cells expressing IMT1 
were assayed with (A) or without (•) myo-inositoi. Control extracts 
from ceils transformed with only Bluescript (O) were assayed with 
myo-inositoi. [ u C]methyi-SAM was included as a methyl group 
donor. A portion (8%) of post-assay carbohydrates were resolved by 
HPLC as described in Materials and methods. I4 C d.p.m. of 0.5 min 
fractions from representative HPLC runs are shown. Inset: to define 
more precisely the retention time of the IMT1 product, 7.5 s fractions 
with retention times between 10.5 and 12.0 min were collected and 
analyzed. 



incorporation into the product was linear during the course 
of the assay (data not shown). These assays indicated that 
Irml encodes a SAM-dependent myo-inositoi 0-methyl 
transferase. 

The l4 C-labeled product of the assays on E.coli extracts 
was visible on HPLC traces as a distinct peak with a retention 
time of slightly under 11.1 min (Figure 8, panel 3) that was 
not present in assays from control extracts (Figure 8, panels 
1 and 2). To establish that the methylated myo-inositoi 
generated by the IMT1 protein was ononitol, the methylated 
intermediate in pinitol biosynthesis, its retention time was 
compared with that of methyl-myo- inositol standards. There 
are four methyl-myo-inositol isomers: sequoyitol, ononitol 
and d- and L-bornesitoI (Id- or lL-l-O-methyi rayo-inositol) 
(Loewus and Loewus, 1980). Only ononitol and sequoyitol 
are possible precursors for pinitol (Dittrich and Brandl, 1987; 
Figure 5), and only ononitol has been documented as the 
precursor to pinitol in angiosperms (Dittrich and Brandl, 
1987). Extracts from control EcoU (such as that shown in 
Figure 8, panel 1) were spiked with standards, processed 
as assays (see Materials and methods) and analyzed by 
HPLC. The retention times of sequoyitol and bornesitol were 
1 1.5 and 12.2 min respectively (data not shown). Ononitol, 
however, displayed a retention time identical to that of the 
reaction product (Figure 8, panel 4), indicating that the myo- 
inositol-dependent 0-methyi transferase encoded by the Irml 
mRNA does indeed generate the methylated intermediate in 
the biosynthesis of pinitol. 

IMT1 activity in plant extracts 

To verify that the induction of the Irml mRNA in stressed 
M.crystallinum is accompanied by an increase in IMT1 
expression, extracts from leaves of salt stressed (102 h) and 
unstressed M.crystalUnwn were assayed for IMT1 enzyme 
activity (Figure 9). Carbohydrates from post-assay plant 
extracts were resolved by HPLC, fractions were collected 



o 

to 
c 
o 
a. 

W 
CD 

DC 



a 
a> 

a 



J 




\ r 





n.1 




Relative Retention Time 



Fig. 8. Identification of the rr^yl-myo-inositol product of IMTI. Reparative HPLC traces of carbohydrates ^ P^^of^^^l 
shown: Panel 1: assay of extract confining IMTI but no myo-inositoi. Panel 2: assay £,<n myo-tnos.tol> of extract from ^5 n mo7 
with vector onlv. Panel 3: assay of exnac. containing IMTI protein plus myo-inositoi. Panel 4: ^ «mt *^*L^F?£Z i'roduT° 
of ononitol standard. Peak labels: G. glycerol added to protein extracts for storage: M. myo-mos.tol added to extrac* 

O. ononitol spike. Tilted arrows indicate the position of an endogenous Lcoli peak present m all extracts, usen.1 here as a marker. Retention ttmes 

of product and of ononitol standard are indicated. 
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Fig. 9. IMTI activity in stressed and unstressed M.crysiallinum. Crude 
soluble protein extracts from leaves of unstressed plants (unstressed) or 
plants stressed for 102 h (stressed) were assayed for myo-inositol 
OMT activity in assays similar to those run on Ecoii extracts (Figure ' 
7). Assays were run for 2 h. during which time l4 C incorporation into 
product was linear (data not shown). Proteins were extracted from leaf 
material pooled from three stressed or three unstressed plants. Activity 
values are the mean of three separate assays; standard deviations are 
shown. No activity was detected in assays carried out without myo- 
inositol (data not shown). 

and c.p.m. determined by scintillation counting. I4 C c.p.m. 
were detected in a single peak with a retention time of -11.1 
min and the appearance of 14 C in carbohydrate fractions 
was dependent on the addition of myo-inositol to assays (data 
not shown). Stressed plants exhibited a large ( — 60-fold) 
increase in IMTI activity over unstressed plants, which 
contained an almost undetectable level of activity 
(Figure 9). 14 C incorporation into product was linear during 
the course of the assay (data not shown). These assays 
indicated that Imtl mRNA induction results in increased 
IMTI activity in stressed-plant extracts, suggesting that the 
increased expression of this transcript is, at least in part, 
responsible for the concurrent accumulation of pinitol 
observed in salt-stressed M.crystallinum. 

Discussion 

Halophytic plants have developed mechanisms of osmo- 
regulation that allow them to avoid salinity-induced water 
stress. We have used a facultative haiophyte, M.crystailinum, 
to investigate the molecular basis of salt tolerance. When 
subjected to salinity stress, M.crystallinum undergoes a brief 
period of wilting followed by recovery and continued growth 
(Bohnen et a/., 1988; Winter and Gademann, 1991). In an 
anempt to elucidate the biochemical mechanisms responsible 
for this observed adjustment to stress, we generated a cDNA 
library enriched for transcripts induced during the initial 
hours of stress.* Differential screening of this library yielded 
a dramatically up- regulated mRNA, Imtl, encoding a 
polypeptide with homology to several methyl transferases. 



Expression of the protein in E.coli and activity assays 
identified the enzyme as a myo-inositol 0-methyI transferase. 
This enzyme methylates myo-inositol to produce ononitol, 
which is subsequently epimerized by a different enzyme to 
produce pinitol, a cyclic sugar alcohol known to accumulate 
to high levels in salt-stressed M.crystallinum (Paul and 
Cockburn, 1989). Enzyme assays performed on plant 
extracts established that the induction of Imtl mRNA is 
accompanied by the appearance of myo-inositol 0-methyl 
transferase activity in salt-stressed plants. To our knowledge, 
this study is the first example of differential screening of 
a cDNA library leading to the biochemical identification of 
a novel plant enzyme with a defined physiological function. 

Pinitol is one of a larger class of compounds, sugar 
alcohols, whose presence correlates with osmotic stress 
tolerance in a wide range of organisms. Cyclic sugar alcohols 
such as pinitol and straight-chain polyols, such as mannitol 
and sorbitol, are found in high levels in desiccation- and salt- 
tolerant bacteria, fungi and algae as well as in halophytic 
higher plants (Csonka and Hanson, 1991; Bieleski, 1982; 
Hellebust, 1976). Pinitol is the most common of the various 
myo-inositol-derived cyclitols found in plants (Dittrich and 
Korak, 1984). It is found at high levels in a diverse set of 
salt-tolerant plant species, including maritime pine (Nguyen 
and Lamant, 1988), several legumes (Gorham et aL, 1988), 
halophytic mangroves (Popp, 1984) and both dicots and 
monocots from maritime habitats (Gorham et aL , 1981). In 
M.crystallinum, pinitol can accumulate to >70% of total 
soluble carbohydrate (up to 10% of plant dry weight) 
following NaCl stress (Paul and Cockburn, 1989). It may 
reach intracellular concentrations of up to 300 mM. 
Localization studies by Paul and Cockburn (1989) suggest 
that >60% of the pinitol in stressed M.crystallinum is in 
the cytoplasm, with the rest residing in the chloroplasts. 

The exact physiological role of sugar alcohols during 
osmotic stress has not been determined. One view is that 
sugar alcohols are involved in osmoregulation: they 
accumulate to high levels to act as intracellular osmolytes 
that balance external osmotic pressure. In many osmotolerant 
fungi and algae, sugar alcohol levels are variable and 
correlate with the osmotic potential of the extracellular 
environment (Bieleski, 1982; Hellebust, 1976). In halophytic 
higher plants, sugar alcohols are thought to accumulate in 
the cytoplasm (and/or chloroplasts) to counteract the high 
osmotic potential of the vacuoles, which selectively 
accumulate salt to maintain turgor and sequester ions away 
from the cytosol, where they would be harmful to enzyme 
activities (McCue and Hanson, 1990; Pollard and Wyn 
Jones, 1979; Hellebust, 1976). A similar role has been 
proposed for other low molecular weight metabolites such 
as proline and glycine-betaine, which accumulate in- many 
environmentally-stressed plants (McCue and Hanson, 1990; 
Pollard and Wyn Jones, 1979). Compounds that accumulate 
to high levels to serve as osmolytes in the cytoplasm must 
be compatible with cellular function. They must be 
non-disruptive to enzyme structure and activity even when 
present in high concentration, and they must be non-reactive 
(Yancey etal. y 1982; Gorham */d/., 1981; Hellebust, 
1976). Also, osmolytes that accumulate to high intracellular 
concentration must be removed from the flux through major 
metabolic pathways. Metabolically inert, non-reactive 
cyclitols such as pinitol fit all of these criteria. 
Osmoregulation is only one of the roles suggested for sugar 
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alcohols in stress-toleram organisms. It has also been 
proposed that their primary purpose is one of macro- 
molecular osmoprotection (Schoben, 1977). In vitro studies 
indicate that these compounds may have hydrogen bonding 
characteristics that allow them to protect macromolecules 
from the adverse effects of excessive water loss, high 
temperature and increased ionic strength in the surrounding 
medium (Laurie and Stewart, 1990; Bieleski, 1982; Ahmad 
et a/., 1979), presumably by tightly associating with proteins 
and membrane components to replace water lost during 
desiccation (Yancey et ai. y 1982; Schoben, 1977). For this 
purpose, sugar alcohols are thought to be important in 
desiccation-tolerant organisms such as yeast (Coutinho et ai , 
1988)* and, possibly, the resurrection plant Craterostigma 
plantagineum (Bartels et ai., 1991). It is likely that sugar 
alcohols can carry out either osmoregulatory or osmopro- 
tective roles in the various organisms where they are found, 
depending on the level to which they accumulate, their 
intracellular location and the environmental stress being 
encountered. 

Although polyol accumulation has been correlated with 
osmotoierance in a number of plants, the mechanisms of 
accumulation have not been clear (Loewus and Dickinson, 
1982). It has been noted that the higher levels of cyclitols 
and other putative intracellular osmolytes seen in stressed 
or salt-tolerant plants could, rather than being a beneficial 
response, be a consequence of 'impaired metabolism' during 
stress (Wyn Jones and Gorham, 1983). The transcriptional 
induction of the myo-inositol OMT (Figure 3) during 
adjustment to saline conditions indicates that sugar alcohol 
accumulation in the ice plant is not an artifact of disturbed 
metabolism. Rather, it appears to be pan of a genetically 
determined, adaptive response that enables M.crystallinum 
to adjust physiologically to extreme osmotic stress. 

The IMT enzyme is one of four distinct 0-methyl 
transferases that methylate myo-inositol at specific hydroxyl 
positions to create ononitol, sequoyitol, or L- or D-bomesitol 
(Loewus and Loewus, 1980; Hoffmann-Ostenhoff et ai, 
1978). Although two of these enzymes (those that methylate 
myo-inositol to make l- and D-bornesitol) have been purified 
(Hoffman-Ostenhoff, 1978), none have been extensively 
biochemically characterized and no myo-inositol OMT 
sequences have been determined prior to this study. That 
these enzymes are highly specific for their site of methyla- 
tion is supported by the work of Hofl6nann-OstenhofiF(1978), 
as well as by our assay results, which generated only ononitol 
(Figures 7 and 8; and data not shown). Cloning and sequence 
analysis of other myo-inositoi OMTs, as well as more 
in-depth biochemical analysis of the ice plant OMT described 
here, would provide information on the comparative 
biochemistry and specificity of these enzymes. The over- 
expression of IMT1 in E.coli (Figure 6) is a first step for 
both of these endeavors. 

The transient expression of Imtl in roots (Figure 1) 
provided the. first indication that the Imtl transcript was 
involved in a non-organ-specific stress response distinct from 
the leaf-specific switch to CAM photosynthesis that occurs 
in salt-stressed M.crystallinum. However, the different 
patterns of Imtl expression in leaves and roots (Figure 1) 
and the much greater abundance of the transcript in leaf tissue 
may indicate that the cells in these organs have distinct 
mechanisms of osmoregulation or that the roots play a 
different role in the response to stress at the whole-plant 



level. Indeed, it has been noted in physiological studies that 
the root cortical cells of M.crystallinum do not fully regain 
turgor following NaCl stress, while leaves recover turgor 
after 4 days of stress (Winter and Gademann, 1991). The 
greater accumulation of Imtl mRNA in leaves reflects what 
has previously been observed for pinitol levels in the ice plant 
(Paul and Cockburn, 1989). Higher expression of Imtl in 
leaves may be indicative of a role for pinitol in chioroplast 
osmoprotection, although pinitol is not localized solely to 
these organelles (Paul and Cockburn, 1989). 

The salinity-induced accumulation of Imtl is due to 
transcriptional activation (Figure 3). Based on a number of 
genes whose patterns of expression have been studied in the 
ice plant, increased transcription.appears to be the primary 
mechanism of regulation for stress-induced genes (Figure 3; 
Vernon et at, 1991; Cushman et al. y 1989). This is true 
for both CAM-related mRNAs and those more directly 
involved in salt tolerance, such as Imtl. An understanding 
of how the plant's transcriptional machinery is cued by the 
environment is central to an understanding of environmental 
stress tolerance, and this topic is currently being investigated 
by the in-depth analysis of selected promoters. 

The transcriptional induction of a myo-inositol hydroxy- 
methyl transferase in this facultative haiophyte suggests that 
sugar alcohols play an important role in osmotic stress 
tolerance. Whether the exact function of these compounds 
is to serve as non-disruptive cytoplasmic osmolytes or to act 
as *osmoprotectants* of protein and membrane structure 
remains to be elucidated. Experiments utilizing trans- 
formation of glycophytes with the Imtl gene will address 
the question of whether production of cyclitols is sufficient 
for higher plant osmotoierance. If so, the ability of this 
methyl transferase to create a non-reducing, metabolicaily 
inert sugar alcohol from a ubiquitous plant substrate makes 
it an attractive enzyme for the creation of stress-resistant 
transgenic plants. 

Materials and methods 

Plant material 

M.crystallinum were grown from seed as described (Ostrem et aL. 1987) 
and either kept in soil or transferred 14 days after seeding to Hoaglancfs 
solution in aerated, light-tight containers for hydroponic growth. Piano were 
salt stressed when 6 weeks old (unless otherwise noted) with 500 mM Nad 
(Ostrem et at. 1987) or. for hydroponics, by addition of 400 mM Nad 
to Hoagland's solution. Unstressed controls for each experiment were grown 
alongside stressed plants and were harvested at the same time. To avoid 
potential ex p e rim e n tal variation due to diurnal fluctuation of RNA and protein 
levels, all harvests were carried out at the same time during the dark/light 
cycle (4 h before the end of the light period). Harvested^material for RNA 
extraction was frozen in liquid N 2 and stored at -70*C. 

Subtracted cDNA library construction and screening 
Leaf RNA was extracted and poly(A)* mRNA prepared as described'by 
Ostrem et cl. (1987). First-strand cDNAs were generated from poly(A) 
RNA from stressed plants using a kit (BRL, Gaithersburg. MD). Follow- 
ing first-strand synthesis, RNA was removed by base hydrolysis with 0.1 
M NaOH (70°C, 60 min) and single stranded (ss) cDNA was hybridized 
to a 5.5-fold excess of poly(A)* RNA from control plants using the 
protocol and conditions described (Briehl et aL % 1990). Hybridizations were 
carried out for 18 h. Single-stranded cDNA remaining after hybridization 
was isolated by differential elution from hydroxyapautc (Davis, 1986) and 
made double-stranded using cONA synthesis kit materials (BRL, Gaithers- 
burg, MD) with poly(dN) 6 random primer (Pharmacia Inc., Piscataway, 
NJ). Double-stranded cDNA was ligated to henu-phosphorylated EcoBl/Sod 
adaptors (Invitrogen inc.. San Diego, CA). ligated into XZapU (Siratagene 
Inc.. La Jolla. CA) and packaged using Gigapack in vitro packaging extracts 
(Stratagcne Inc.. La Jolla. CA). Phage were plated and screened by three 
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cvclcs of standard differential plaque hybridization (Maniaiis et aL , 1982). 
^P-labded ss cDNA probes used for screening were generated from 
poly (A)" RNA isolated from unstressed M.crystallinum and plants salt 
stressed for 10 h. DNA from selected plaques was rescued as Bluescript 
SK" phagemids (Stratagene Inc., La Jolla, CA) and used to transfect Ecoti 
XLl-Blue cells for future DNA analysis and manipulation. Cloned insert 
DNA was isolated (Maniatis et aL , 1982), excised by £coRl digestion and 
analyzed on agarose gels. 

RNA isolation. Northern analysis and transcription assays 
Tool RNA from leaves and roots of hydroporucally-grown plants was isolated 
as previously described (Ostrem et oL . 1987), except that the buffer used 
for root RNA extractions contained additional detergent: 2% SDS and 2% 
Triton X-100. Total RNA was resolved on formaldehyde -agarose gels and 
Northern blotted (Thomas, 1983). Blots were probed, washed at high 
smngency and visualized as previously described (Vernon etoL. 1988) using 
32 P-labeied, gei-purifted Imtl insert DNA as a probe. Nuclei isolations 
from fresh leaf tissue, in vitro run-on experiments, DNA slot blotting and 
hybridization of transcripts to DNA were carried out as described previously 
(Cushman et oL. 1989). The c.p.m. hybridized to filters were directly 
quantified using a ^-scanner (Betagen Inc., Waltham, MA). 

Gen* copy number determination /ino1N 
Nuclear genomic DNA was prepared as described by Bedbroolc (1981). 
4 fig of nuclear DNA was digested to completion with either £coRI, HindSH 
or Hindi resolved on 1% agarose gels and Southern blotted (Southern. 
1975). Blots were processed as described (Vernon et oL , 1988). P-labded 
probes were generated from gel-purified 5'- or 3 '-end restriction fragments 
of Imtl or from full-length insert. Genome copy-number equivalents were 
based on an estimated M.crysmiUnum haploid genome size of 390 Mbp 
(De Rocher et oL. 1990). Hybridization signals were quantified using a 0- 
scope (Betagen, Inc., Waltham, MA). 

Sequence analysis . 
The nucleotide sequence of both strands of the Imtl cDNA was obtained 
by dideoxy sequencing (Sanger ctaL. 1977) of suitable overlapping^/*// 
restriction fragments and full-length insert cloned into Bluescript KS* and 
KS* cloning vectors (Stratagene Inc., La Jolla, CA). IMT1 amino acid 
sequence and restriction sites were identified using the Mount- Conrad - 
Meyers program (Williams. 1988). Sequence comparisons to the NBRF 
protein sequence database were performed using the FASTA program 
(Pearson and Lipman, 1988). 

Expression of IMT1 in E.coft + 
The full IMT1 reading frame was cloned into Bluescript KS as a 
transcriptional fusion in both orientations behind the T7 polymerase promoter. 
Constructs were transformed into BL21(DE3) ceils (Studier et aL, 1987), 
which contain the gene encoding the T7 polymerase under the control of 
an isoprow^D-toosalactoside (IPTG>indudblc promoter. A fortuitously 
located AAGAG sequence in the 5' leader of the cDNA was predicted to 
act as an Ecoli ribosome binding site. IFTG was added to cultures at a 
final concentration of 0.5 mM 4 h before harvest. Protein expression was 
analyzed by SDS-PAGE on 10% acrylamide gels (Ostrem et oL. 1987). 
Samples were prepared by boiling aliquots of £. coU cultures for 3 min in 
an equal volume of SDS extraction buffer. 

In vitro transcription of the Imtl cDNA was carried out with T7 RNA 
polymerase (Stratagene Inc., La Jolla, CA). In vitro translation of the 
resulting transcript was accomplished using a reticulocyte lysaie system (BRL 
Inc., Gaithersburg, MD). [ 3 %]rnethionine-labcied translation product was 
resolved by SDS-PAGE alongside Ecoli protein samples and visualized 
by fluorography. 

Soluble protein extraction and activity determination 
Soluble protein from Ecoli transformed with the IMT1 expression construct 
or native Bluescript KS* was extracted from 20 -200 ml cultures pelleted 
at 2500 £ for 10 min and resuspended in methyl transferase extraction buffer 
(MTEB): 100 mM Tris-Cl pH 8, 10 mM EDTA and 0-mercaptoethanol; 
I ml/20 mi of culture). Cells were lysed by sonication and extracts were 
clarified by centriruzanon at 10 000 g for 20 min. Total protein concentration 
was determined by'the method of Ghosh et oL (1988). Supematants were 
either used immediately for assays or stored at -70°C with 5% glycerol. 

Myo-inositoi O-methyl transferase assays on Ecoli extracts were earned 
out in a 200 id volume containing 1.0 mg total soluble protein, 50 mM 
Tris-Q pH 8.0. 40 mM MgCK and 1.0 mM myo- inositol. Assays were 
pre-incubated at 30°C for 5 min and initiated by the addition of 5-adenosyl- 
L-mcthjonme (SAM) to a final corcentration of 0.5 mM. SAM stock solution 
contained unlabeled SAM (Sigma Chemicals St Louis, .MO) and C- 



labeled SAM (ICN Biochemicals, Irvine. CA) at a 50: 1 ratio. Assays were 
carried out at 30*C for 60 min and terminated by transfer to ice and 
chloroform extraction. The aqueous phase was subjected to further processing 
and HPLC analysis (see below). 

Total soluble plant protein was extracted as described by Ostrem et aL 
(1987) using MTEB plus 2 mM leupeptin as extraction buffer. Assays were 
carried out on crude leaf extracts as with Ecoli extracts, except that myo- 
inositol was included at a final concentration of 5 mM and assays were 
run for 2 h. Assays were terminated with phenol —chloroform (1:1) extraction 
followed by a chloroform extraction. 

HPLC analysis of assay products 

Samples were prepared for HPLC by extraction with 2 vol meihanol/chloro- 
form/ water (12:5:3) followed by addition of 0.4 ml water. A desalting column 
of AG50WX4 (Bio-Rad. Richmond, CA) and Amberiite IRA-68 (Sigma 
Chemicals. St Louis, MO) in OH-form was used to desalt extracts and 
remove charged species (including remaining ( 14 C]SAM). Samples were 
dried, dissolved in deionized water and filtered through a nylon Acrodisc 
13 (Gelman, Ann Arbor, MI). Equal amounts of dissolved carbohydrates 
from each assay were resolved on a 300 x 7.8 mm HPX87C calcium-form 
ligand exchange column (Bio-Rad. Richmond, CA) at 85 °C with a 0.6 
ml/min flow rate using degassed, deionized water as an eluent. Post-column 
NaOH was added at 0.3 M, 0.6 ml/min. and traces were obtained using 
a pulsed amperometric detector (Dionex, Sunnyvale, CA) at 35°C and a 
Specoophysics SP4290 integrator (Spectrophysics Analytical, San Jose, CA). 
Fractions were collected at 7.5 s or 0 JS min intervals and scintillation counted. 
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Abstract 

LEA proteins are late embryogenesis abundant in the seeds of many higher plants and are probably 
universal in occurrence in plant seeds. LEA mRNAs and proteins can be induced to appear at other stages 
in the plant's life by desiccation stress and/or treatment with the plant hormone abscisic acid (ABA). A 
role in protecting plant structures during water loss is likely for these proteins, with ABA functioning in 
the stress transduction process. Presented here are conserved tracts of amino acid sequence among LEA 
proteins from several species that may represent domains functionally important in desiccation protection. 
Curiously, an 1 1 amino acid sequence motif is found tandemly repeated in a group of LEA proteins of 
vastly different sizes. Analysis of this motif suggests that it exists as an amphiphilic a helix which may 
serve as the basis for higher order structure. 
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Introduction 

The amino acid sequences for a number of LEA 
proteins have been deduced in the past year from 
cDNA and/or genomic DNA sequences. These 
include 6 proteins from cotton [ 1 ], 2 for carrot [3, 
Seffens and Thomas, in preparation] and 1 each 
from barley [8], rice [12], rape [7] and wheat 
[ 10]. All of these proteins and their mRNAs accu- 
mulate in embryo tissues of seeds as they 



approach maturity and begin to desiccate. The 
LEA mRNAs are not readily detected in leaves or 
roots of non-stressed plants nor in young imma- 
ture seed tissues. The cotton LEA mRNAs can be 
induced to high levels prematurely by treating 
immature embryos with ABA [6]. The mRNA for 
the LEA protein of barley, rice and wheat can be 
induced to reappear in germinating seedlings by 
treatment with ABA [8, 12, 2]. Severe desiccation 
of leaves induces the rape LEA mRNA [7]. Thus 
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Table 1. LEA proteins of known sequence. 



Species 


Derivation 


Amino acids 


xvksji. wi. 


Clone name 


Ket. 


Group 1 












wheat 


enmnlptf* rDNA 


93 




Em protein 


[10] 


cotton 


- comolete cDNA 


120 


11 1 Sfl 

i j ijyj 


n 10 


rn 


Group 2 












rice 


genomic DNA 


163 


16526 


rab21 


T121 


cotton 


complete cDNA 


145 


15793 


D 11 


[1] 


Group 3 












cotton 


composite cDNA/ 


136 


14597 


. D7 


[1] 




genomic DNA 










carrot (I) 


complete cDNA 


163 


16989 


Dc-3 


[ + ] 


barley 


complete cDNA 


213 


21803 


pHVal 


[8] 


rape 


complete cDNA 


280 


30360 


pLEA 76 


[7] 


carrot (II) 


incomplete cDNA 


323* 


66000** 


Dc-8 


[3] 



[ + ] Sefifens and Thomas, in preparation 
* Of sequenced fragment. 
** From gel electrophoresis. 
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there may be a functional linkage between 
desiccation stress, ABA and LEA proteins. The 
amino acid sequences of these LEA proteins from 
the different species have been compared by dot 
matrix analysis and three of the cotton LEA pro- 
teins were found to share regions of homology 
with proteins from the other plant families. These 
groupings are: cotton LEA D 19 and the wheat 
Em protein; cotton LEA D 11 and the rice pro- 
tein rab 21 ; cotton LEA D 7 and a LEA protein 
from barley, rape and two from carrot. These 
groups are listed in Table 1. The carrot sequences, 
although sharing regions of amino acid homology, 
represent 2 distinctly different proteins and are 
referred to as carrot I and II. The regions of 
homology among the Group 3 LEA proteins in- 
volve a repeating tract of 11 amino acids that 
possibly exists in an a helix with twisting amphi- 
philic surfaces. 



Dot matrix analysis 

The sequence comparisons were carried out using 
the computer program DSAS 2 created by J. Nei- 
gel, Southwestern Louisiana University. The 
stringency for the comparisons between different 



proteins required that 7 of 1 1 amino acids match 
to generate a dot, but allowed synonym amino 
acids (having functionally similar R groups) to be 
considered identical. These groupings are: 
Asp = Glu, Asn = Gin, Arg = Lys, Ser = Thr, 
Phe - Tyr = Trp, Ala = Val = He = Leu = 
Met. These groupings reduce the number of amino 
acids from 20 to 10, but requiring a 7 of 1 1 match 
predicts a random match only once in over 
200000 11 amino acid comparisons. Thus ran- 
dom background dots are extremely rare. 

Such a calculation of background random dots 
assumes that amino acid of usage is random, 
which is not true. However, the tracts of sequence 
homology observed in these comparisons are 
clearly meaningful, and denote the conservation 
of ancestral sequence motifs. 

Where a protein sequence is compared with 
itself to visualize repeated amino acid sequences, 
the stringency was raised to require an 8 of 11 
match. 

Tracts of nucleotide sequence homology 
between cDNAs/genomic DNAs for these pro- 
teins is not readily discernible in all cases by dot 
matrix analysis. In order to generate diagonal 
tracts of dots, the stringency of the comparisons 
must be lowered to the point that considerable 
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random background dots are generated. In the 
two instances where enough homology exists to 
allow background to be reduced, the generation of 
a dot required that 9 of 12 nucleotide match. This 
predicts the likelihood of a random background 
dot to be roughly 1 in 3000 twelve nucleotide 
comaprisons. 



Cotton/wheat LEA proteins 

Dot matrix amino acid sequence comparison 
between the cotton LEA D 19 protein and the Em 
protein of wheat is shown in Fig 1 (top right). It 
is apparent from this figure that these proteins 
have not diverged greatly in the > 100 million 
years that separates the evolution of cotton and 
wheat. In Fig. 2 (top) the sequences of these two 
proteins are aligned. The cotton protein is seen to 
have a few additional amino acids near the N 
terminus and an additional 18 amino acids at the 



C terminus not found in the wheat protein. Of the 
93 amino acids aligned, 65 are identical. Of the 28 
that are different, 9 involve differences between 
synonyms. Neither protein contains Cys or Trp. 
No structural domains such as regions of amphi- 
philic a helices are suggested by the sequences of 
these two proteins. Fig. 1 (top left) compares the 
nucleotide sequences of the cDNAs from which 
the amino acid sequences were derived. Frag- 
ments of a diagonal line of homology are visible, 
indicating that high nucleotide homology is limit- 
ed to a few scattered regions. 



Cotton/rice LEA proteins 

Fig. 1 (bottom right and left) gives the dot matrix 
comparison of amino acids and protein coding 
nucleotides respectively for the cotton LEA pro- 
tein D 11 and the rab21 protein from rice 
embryos. It is apparent from these plots that 
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Fig, 1. Dot matrix analyses of amino acid sequences (right) and nucleotide sequences (left) of the Group 1 (top) and 2 (bottom) 
LEA proteins. Tic marks along the x and y axes represent 20 amino acids or nucleotides. In the amino acid comparison, a match 
of 7 in a span of 1 1 yields a dot. Functionally similar amino acids are treated as identical (see text). In the nucleotide comparisons, 

a match of 9 in a span of 12 yields a dot. 
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COMPARISON OF THE WHEAT LEA PROTEIN WITH THE COTTON D 19 PROTEIN 



cotton 1 MET ALA SER GLU GLN TYR GLN ALA MET ARG ASN ALA PRO GLN GLU GLU LYS GLU GLU LEU 

wheat 1 gly ARG ser gin 

21 ASP ALA ARG ALA LYS GLN GLY GLU THR VAL VAL PRO GLY GLY THR ARG GLY LYS SER LEU 

12 arg LYS ARG glu gly 

41- ASP ALA GLN ILE ASN LEU ALA GLU GLY ARG HIS LYS GLY GLY GLU THR ARG LYS GLN GLN 

32 GLU glu ser ARG gin ARG glu 

61 LEU GLY THR GLU GLU TYR GLN GLU MET GLY- ARG LYS GLY GLY LEU SER ASN SER ASP MET 

52 MET glu ser gin thr asn glu 

81 SER GLY GLY GLU ARG ALA ALA ASP GLU GLY VAL THR ILE ASP GLU SER LYS PHE ARG THR 
72 ASP arg ILE asp LYS 

101 LYS LYS LEU ASN ILE ILE SER HIS ASP ARG THR LEU ALA THR PRO PHE ILE TYR LYS PRO 

92 ser • •*.• ••• 



COMPARISON OF THE RICE LEA PROTEIN WITH THE COTTON 0 11 PROTEIN 



rice 1 MET GLU HIS ... GLN GLY GLN HIS GLY HIS VAL 
cotton 1 MET ala HIS phe GLN asn GLN tyr ser ala pro 



11 
21 



thr 


gin thr 
+ 


asp 
* 


ala 
* 


GLY 
pro 


thrIgly 

THRjgin 


ALA 
glu 


GLY 
thr 


GLY 
his 


THR GLY 
gly leu 


ARG 
his 


GLN 
arg 


GLU 
GLU 


ASP ASP 
ASP GLU 


GLY 
GLY 


MET 
thr 


* 

LEU 
ILE 


PRO GLY 
PRO GLY 


GLY 


ASN 
ASN 


GLY 


THR GLY 


THR 


GLY 



29 GLN MET GLY THR ALA GLY MET GLY THR HIS GLY THR 
41 gly ILE leu gly ILE GLY gly his his HIS GLY gly 

49 VAL LEU GLN ARG f SER GLY SER SER SER SER SER SER 
61 [SER SER SER SER SER SER SER SER 

69 ARG ARG LYS LYS GLY ILE LYS GLU LYS ILE LYS GLU 
76 LYS LYS LYS LYS GLY LEU LYS GLU ARG LEU LYS GLU 

89 GLU GLN GLN HIS ALA MET GLY GLY THR GLY THR GLY 

109 GLY ALA TYR GLY GLN GLN GLY[HiS GLY THR GLY MET THR THR GLY THR THRlGLY ALA HIS 
y5 [HIS gin SER gin ALA THR SEK thr THR THR] pro gly gin 

129 GLY THR THR THR THR ASP THR GLY GLU LYS LYS GLY ILE MET ASP LYS ILE LYS GLU LYS 
108 GLY pro THR tyr his gin his his arg glu glu arg ser asp gly gin gly glu ala pro 

149 LEU PRO GLY GLN HIS 

128 trp ser pre GLN pro leu lie ser cys leu trp ser ala ile ser tyr 

Fig. 2. Sequence alignment of the Group 1 (top) and 2 (bottom) LEA proteins. Missing amino acids (gaps) are indicated with 
dots. In the top alignment, amino acids in the wheat protein that differ from those of the cotton protein appear in lower case 
letters. Those that differ but represent synonym functions appear in upper case letters. In the bottom alignment identical or 
synonym amino acids appear in upper case letters, mismatched amino acids in lower case letters. Region of high homology are 
bracketed, sequences repeated in the cotton protein are underscored with asterisks, those repeated in the rice protein are 
overscored with asterisks. The tract of contiguous Ser residues is overlined. 



homologous tracts exist only in portions of these 
proteins. In Fig. 2 (bottom) these amino acid 
sequences have been aligned by eye. The regions 
of homology, seen as diagonals in Fig. 1, are 
bracketed. The cotton protein has a short se- 



quence occurring twice (6 of 7 amino acids) near 
the N terminus that is underscored by asterisks in 
the figure. It appears once in the rice protein as 
part of the first region of homology. The second 
region of homology is curious in that it begins with 
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a tract of 9 contiguous Ser residues in the cotton 
protein, 8 of 9 in the rice protein (overlined in the 
figure). Also within this second region of 
homology is a stretch of 14 amino acids that 
occurs again in the rice protein, overscored with 
asterisks. Both proteins are very high in Gly and 
hydroxyl-bearing amino acids, the rice protein 
containing a tract of 6 contiguous Thr-Gly pairs. 
The high Gly content would seem to preclude 
extensive a helical domains in either protein. 



Cotton/barley/carrot/rape LEA proteins 

Five other LEA proteins from 4 plant species 
(cotton, carrot, barley and rape) ranging in size 
from roughly 14.5 to over 60 kD have regions of 
sequence homology. Each of these proteins 
contain repeated amino acid tracts. This is shown 
in Fig. 3 (left column) which gives the dot matrix 
analysis of each protein compared with itself. The 
repeated sequences occur principally in the N 
terminal 1/2 of these proteins. The carrot II 
sequence used here is from an incomplete cDNA 
sequence; further data suggest that it represents 
the N terminal half of the carrot protein as well 
(Franz et al., in preparation). The number of the 
repeats in each protein varies considerably as 
does the size of the proteins. The pair wise 
sequence comparison of all 5 of these proteins, 
shown in the remainder of Fig. 3, indicates that 
the repeated sequences are shared among these 
proteins. Regions of homology observed between 
the carrot II protein and the cotton, rape and even 
the carrot I protein are very limited, but, as will be 
seen, the repeated sequence of the carrot II pro- 
tein has diverged considerably from the consensus 
repeated sequence found in the other proteins. 

Figure 4 gives the amino acid sequence of the 
regions carrying the repeat in the 5 proteins 
beginning with the smallest region, containing 5 
repeat units, which is found in the cotton D 7 
protein. The repeating unit is seen to be a tract of 
11 amino acids that is repeated contiguously. 
Clearly there is considerable degeneracy in the 
repeat within a single protein as well as between 
proteins. In two instances (barley and carrot I 



protein) the continuity of the repeats is inter- 
rupted by tracts seemingly unrelated to the repeat 
sequences. Curiously, in 2 of the 3 such instances, 
the intervening tract is 11 amino acids. Further, 
amino acids of the repeating unit in some in- 
stances have been lost, creating gaps in the unit 
(barley, rape and carrot II). 

Scrutiny of the repeating units of the rape and 
carrot II proteins suggests that during the evolu- 
tion of these proteins the original 11-mer repeat 
unit was duplicated once, followed by divergence 
of each of the 2 units. Subsequently the 22 amino 
acid unit was duplicated several times, i.e., two 
somewhat different 11-mer repeats can be dis- 
cerned in these proteins that occur in an alter- 
nating pattern. Thus these sequences are present- 
ed in Fig. 4 in tracts of 22, spaced to indicate the 
2 11-mer units. 

Despite the degeneracies listed above, it seems 
likely that these 5 proteins have a common 
heritage based upon the 11-mer motif and that 
this conserved motif is important in function. 

The salient chemical features of the 11-mer 
motif become apparent when the consensus 
chemical property of the amino acids occurring at 
each of the 11 positions is determined by sum- 
mation which is presented in Fig. 5. Since the 
11-mer motif in the carrot II protein is divergent 
from the other 4, it is treated independently in this 
summation. The summation shows that in po- 
sitions 1, 2, 4, 5 and 9, Ala and Thr predominate. 
Since these 2 amino acids appear interchangeable 
in these positions, the methyl function of these 
residues must be required. Positions 3, 7 and 11 
are occupied principally by either a negatively 
charged residue or Gin and thus can be con- 
sidered to require a very polar R group. Positions 
6 and 8 prefer positively charged residues. In 
position 10 there seems to be no function required, 
12 different amino acids appearing in the 3 1 repeat 
units examined. From this, the functional consen- 
sus shown in Fig. 5 was arrived at for these 4 LEA 
proteins and the frequency of occurrence in each 
position given in the figure. 

The carrot II protein summation reveals that 
the functional consensus for positions 2, 5, 6, 7, 
9 and 1 1 are the same as for other LEA proteins 




Fig, 3. Dot matrix analyses of Group 3 LEA proteins. In the left column the amino acid sequence of each protein is compared 
with itself, a match of 8 of 1 1 residues yields a dot. In the pair wise comparisons 7 of 1 1 residues must match to yield a dot 
and synonym amino acids are considered identical (see text). Tic marks represent 20 amino acids. 



REPEATED MOTIF IN LEA PROTEINS FROM COTTON, CARROT, BARLEY AND RAPE 



cotton 0 7 Lea 

31 LYS ALA GLU ALA ALA LYS GLN LYS THR MET GLU 
42 THR ALA GLU ALA ALA LYS GLN LYS THR MET GLU 
53 THR ALA GLU ALA ALA LYS GLN LYS THR ARG GLY 
64 ALA ALA GLU THR THR ASN ASP LYS THR LYS GLN 
75 THR ALA GLY ALA ALA ARG GLY LYS ALA GLU GLU 



carrot I Lea 

32 LYS ALA GLN ALA ALA LYS ASP LYS ALA SER GLU 

43 MET ALA GLY SER ALA ARG ASP ARG THR VAL GLU ser lys asp gin thr gly ser tyr val 

65 LYS ALA GLY ALA VAL LYS ASP LYS THR CYS GLU 

76 THR ALA GLN ALA ALA LYS GLU LYS THR GLY GLY 

87 ALA MET GLN ALA THR LYS GLU LYS ALA SER GLU met gly qlu ser ala lys glu 

105 THR ALA VAL ALA GLY LYS GLU LYS THR GLY GLY 



barley Lea 

26 MET MET GLY ALA THR LYS GLN GLY ALA GLY GLN 

37 THR THR GLU ALA THR LYS GLN LYS ALA GLY GLU 

48 THR ALA GLU ALA THR LYS GLN LYS THR GLY GLU 

59 THR ALA GLU ALA ALA LYS GLN LYS ALA ALA GLU 

70 ALA LYS ASP LYS THR ALA GLN 

77 THR ALA GLN ALA ALA LYS ASP LYS THR TYR GLU 

88 THR ALA GLN ALA ALA LYS GLU ARG ALA ALA GLN gly lys asp gin thr gly ser ala leu gly glu 

101 LYS THR GLU ALA ALA LYS GLN LYS ALA ALA GLU 

121 THR THR GLU ALA ALA LYS GLN LYS ALA ALA GLU 

132 ALA THR GLU ALA ALA LYS GLN LYS ALA SER ASP 



rape Lea 

31 LYS ALA GLU GLU GLY LYS ASP LYS THR SER GLN 

53 THR ALA GLN ALA ALA LYS ASP LYS THR SER GLN 

75 THR ALA GLN ALA ALA LYS ASP LYS THR SER GLN 

97 THR ALA GLN ALA ALA LYS ASP LYS THR SER GLN 

119 THR THR GLN SER SER LYS GLU LYS THR SER GLN 

141 THR LYS ASP LYS THR 



THR ALA GLN LYS ALA GLN GLN LYS ALA GLN GLU 

ALA ALA GLN THR THR GLN GLN LYS ALA GLN GLU 

ALA ALA GLN THR THR GLN GLN LYS ALA GLN GLU 

ALA ALA GLN THR THR GLN GLN LYS ALA HIS GLU 

ALA ALA GLN THR ALA GLN GLU LYS ALA ARG GLU 



carrot II Lea 

2 LYS ALA LYS MET ALA LYS ASP THR THR MET GLY 

24 LYS ALA GLU GLU ALA LYS GLU LYS ALA ALA GLN 

35 LYS ALA GLU GLU THR LYS GLU 

53 LYS ALA GLY GLU ALA LYS ASP THR THR LEU GLY 

75 LYS ALA ALA GLU ALA LYS ASP THR THR ALA GLN 

86 LYS ALA ALA GLU ALA LYS GLU 

104 LYS ALA ALA GLU ALA LYS VAL LEU ALA ALA GLN 

115 LYS ALA ALA GLU ALA LYS ASP THR THR GLY 

136 LYS ALA ALA GLU ALA LYS ASP ALA THR MET GLN 

158 LYS THR ALA GLU THR LYS ASP ALA THR MET GLU 

180 LYS ALA ALA GLU ALA LYS ASP ALA THR MET GLN 

202 LYS ALA ALA GLU THR LYS ASP ALA THR MET GLN 

224 LYS ALA ALA GLU THR LYS ASP ALA THR MET GLU 

235 LYS ALA LYS GLU ALA LYS ASP THR THR VAL GLN 

257 LYS ALA LYS GLU GLY LYS ASP VAL THR VAL GLU 

268 LYS ALA LYS GLU GLY LYS ASP THR THR VAL GLY 
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Fig. 4. Amino acid sequences of those regions of the Group 3 LEA proteins that contain the 1 1-mer repeating unit. Tracts 
showing no homology with the repeating unit extend to the right and are in lower case letters. Gaps in the repeating unit are 

left blank. 
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SUMMATION OF AMINO ACIDS IN REPEATING UNIT 



Position 1 2 

(all except carrot II) 

THR 17 ALA 23 



ALA 
LYS 
MET 



methyl 
24/31 



THR 
MET 



methyl 
28/30 



GLN 14 
GLU 11 
GLY 4 
VAL 1 



polar 
25/30 



ALA 21 



THR 
SER 
LYS 
GLU 



methyl 
26/30 



ALA 19 
THR 8 
GLY 2 
SER 1 
VAL 1 



methyl 
27/31 



LYS 24 



GLN 
ARG 
ASM 



GLN 14 
ASP 11 
GLU 6 
GLY 1 



positive polar 
26/32 31/32 



LYS 29 



ARG 
GLY 



THR 17 
ALA 15 



positive methyl 
31/32' 32/32 



10 



GLY 
ALA 
GLN 
MET 
ARG 



HIS, LYS 
GLU, VAL 
TYR, CYS 
1 each 



11 



SER 8 GLU 18 



GLN 
GLY 
ASP 



polar 
28/31 



(carrot II) 

LYS 26 



ALA 19 
THR 5 
ASP 1 
PRO 1 



positive methyl 
26/26 24/26 



ALA 9 

GLY 8 

LYS 6 

GLU 2 

VAL 1 



GLU 24 



SER 
MET 



ALA 10 

TYR 9 

THR 4 

GLY 2 

ASN 1 



negative methyl 
24/26 14/26 



LYS 26 ASP 21 



GLU 
VAL 



THR 8 

TYR 8 

ALA 5 

VAL 1 

LEU 1. 

LYS 1 



positive negative apolar 
26/26 25/26 23/24 



THR 15 
ALA 8 
SER 1 



methyl 
23/24 



ALA 13 
MET 6 
VAL 3 
LEU 1 
GLY 1 



apolar 
23/24 



GLN 16 
GLU 4 
GLY 3 



polar 
20/23 



Fig. 5. The occurrence of amino acids in each position of the 1 1-mer repeat unit has been collated for 4 of the Group 3 LEA 
proteins (top half) and for the carrot II LEA protein (bottom half). The consensus chemical function for each position is shown 
as is the number of times this function appears in this position compared to the total number of repeating units analyzed at this 

position. 



of this group. However, positions 1, 4, and 8 are 
reversed in hydrophilicity/hydrophobicity from 
the others. Position 3 in the carrot II protein does 
not seem to require a specific functional property 
whereas position 10 clearly requires an apolar 
moiety. 



Hydrophilicity of the LEA proteins 

Figure 6 presents hydropathy plots of all 9 of the 
LEA proteins calculated according to the amino 
acid values of Kyte and Doolittle [9]. In the plots 
of the proteins carrying the 1 1-mer repeats (right 
column), the regions containing the repeat are 
indicated by a horizontal bar. The predominant 
feature of all these plots is the overriding hydro- 
philicity of the proteins. The single region in the 
carrot II partial protein that is hydrophobic is due 
to the sequence Val-Leu-Ala-Ala occurring in one 
of the repeating units (underlined in Fig. 4). 
Although the calculation of the hydrophilicity of 



a span of amino acids based on the additivity of 
the properties of free amino acids may be 
questioned [13], the virtually unbroken hydro- 
philicity of these proteins makes it unlikely that 
they span membranes or form globular structures 
with buried hydrophobic cores. The fact that all 
the cotton LEA proteins (identified by hybrid- 
arrested translation and 2D electrophoresis [6] 
are easily solublized by homogenizing tissue in 
0.1 M NaCl pH 7.0 [5] predicts that they are not 
elements of structure nor compartmentalized in 
cells. 



Possible structure of the repeating unit 

The multiple and contiguous occurrence of a tract 
of amino acids with a periodicity of 1 1 suggests 
that this arrangement allows for the formation of 
a secondary and tertiary structure that may be 
necessary for protein function. An obvious possi- 
bility is the a helix, perhaps with amphiphilic 
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Fig. 6. Hydropathy analyses of the LEA proteins calculated as given in Baker et al [1]. Tic marks along the left border represent 
9 hydropathy units. Those values below the zero line are negative, shaded and hydrophilic. The bottom tic marks represent 10 
amino acids. The amino acid span for each value is 9. Groups 1 and 2 LEA proteins are in the left column; Group 3 in the right 
column where regions of the proteins carrying the 11-mer repeating unit are indicated with a bar. 
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properties. The absence of Pro, the paucity of Gly 
and the interspersion of polar and apolar residues 
in the repeat unit point to this possibility. Figure 7 
shows the 'helical wheel' diagram [14] for an a 
helix of 54 amino acids (15 turns) composed of 5 
repeating units of the 1 1-mer motif consensus for 
4 of this group of LEA proteins (A) and consen- 
sus for the carrot II LEA protein (B). In both 
wheels the apolar residues are shaded, the polar 
or charged residues indicated and the positions of 
no apparent specificity left blank. In both cases 



apolar and polar residues appear concentrated on 
a separate face of the helix, conveying an amphi- 
philic property to the helix; yet a small twist is 
given to each face resulting from the 1 1 residue 
periodicity. Figure 8 presents the same 55 residue 
helices in the 'helical net' diagram [4]. In A, the 
methyl function residues (which are the total 
apolar residues of this consensus sequence) are 
circled. In B, the methyl function residues are 





Fig. 7. Helical wheel diagrams of the 1 1-mer repeating unit 
found in the Group 3 LEA proteins. 54 amino acids (15 turns 
of the a helix) representing 5 contiguous repeating units are 
shown. The chemical functions of the amino acid R groups 
are shown as: apolar, shaded; positive and negative as 
indicated; unspecified positions left blank. A represents the 
consensus repeating unit for 4 of the Group 3 proteins and 
B that for the carrot II protein. 
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Fig. 8. Helical net diagrams of the 1 1-mer repeating units of 
the Group 3 LEA proteins. A represents the repeating units 
of 4 of this group and B that of the carrot II protein. The N 
terminus of the repeating unit is at the top. Amino acids with 
the methyl group function are circled. In B, other apolar 
amino acid functions are underlined and possible salt bridged 
residues joined with an elipse. The methyl spines found in 
both diagrams are linked with a line. 
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circled and the other apolar residues are under- 
lined. The B helix shows the 5 potential salt 
bridges between positive [1, 12, 23, 34 and 45] 
and negative [4, 15, 26, 37 and 48] residues whose 
formation might stabilize such a helix [11]. Salt 
bridges between the other oppositely charged 
residues of this helix, such as 6 and 7, are not 
likely. What is common to both helices is a slightly 
twisting spine of methyl groups parallel to the 
helix axis. Intramolecular or, more interestingly, 
intermolecular docking of these helices, driven by 
the apolar spine, to form tertiary filamentous 
structures is a distinct possibility. 



Discussion 

The desiccation of plant seeds is characterized by 
the accumulation of the LEA proteins which are 
not found abundantly in other organs of the plant 
nor in embryonic tissues of seeds at earlier stages 
of embryogenesis. Yet, where tested, the mRNAs 
for many of the LEA proteins (and presumably 
the proteins themselves) can be induced to detect- 
able levels in other organs or in seed tissues at 
other times in development by water stress and/or 
exposure to exogenous ABA. Desiccation stress 
is a normal aspect of seed formation, yet dan- 
gerous at other phases of a plant's life cycle. Thus, 
if LEA proteins, or a subset of them, are a part of 
a water stress response system, the system is a 
normal part of the seed developmental program, 
but on-call in tissues at other times in the life 
cycle. We suggest that such a subset of LEA 
proteins, once proven to function in the stress 
response, be named WSPs, mnemonic for water 
stress proteins, analogous to HSPs (heat shock 
proteins). 

The tracts of amino acids found to be 
homologous among proteins of each of the 3 
groups compared here may lead to the recognition 
of common secondary or tertiary structures as 
more LEA proteins are compared. Ultimately 
their function in protecting plant tissues in the 
dessicated or water-stressed state may become 
apparent from these structures. Towards this end, 
a function in coping with desiccation has been 



proposed for several other cotton LEA proteins 
[1] for which no homologous counterparts have 
been found in other plants to date. 

More immediately, the reality of the postulated 
a helices of the group 3 LEA proteins based on the 
11-mer repeated motif awaits demonstration by 
CD spectra and the X-ray crystallographic 
analysis of the purified proteins. 
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standard is in, or very close to. secular equilibrium 

fOr 230 Th/ 238 U and 234U/238U ratios 

8. MSWD (mean square of weighted deviates) - 
0.54. . 

9. MSWD = 2.0. 

10. For a 238 U half-life of 4.4683 x 10 9 yei 
Jaffey, K. R. Flynn, L E. Glendenin, W. C 
A. M. Essling, Phys. Rev. C4, 1889 (19 

11. P. de Bievre et ah, in Proceedings of th 
tional Conference on Chemical and Nuc 
Measurement and Applications, M. L. Hi 
(Institute of Civil Engineers, London, 1: 



221-225. Uncertainty given at 3a level. 

12. I. J. Winograd. B. J. Szabo, T. B. Coplen A C 
Fliggs, Science 242, 1275 (1988). 

13. J. W. Meadows, R. J. Armani, E, L Callis, A M 
Essling, Phys. Rev. C22, 750 (1980). 

14. For all samples, systematic age errors arising 
from uncertainty in the z30 Th half-life (13) are 
smaller than those from analytical errors (because 
of the error-demagnification effect of normalizing 
to the isotopic ratios of the secular-equilibrium 
standard). For those samples where 230 Th ages 
are preferred over 23A \J ages, the worst-case age 
error from 230 Th decay-constant uncertainty is 
1 1 50 years (for the 235.9-ka samples at 11 1 mm) , 
compared to the analytical error for that sample of 
3100 years. Errors from 230 Th decay-constant 
uncertainties for all other samples <400 ka are 
less; errors for 400- to 600-ka samples rise expo- 
nentially from -0 to 10 ka. Age errors arising from 
uncertainty in the 234 U half-life (7 7) are negligible 
for all samples. 

15. For example, a 0.2% departure in the Z3t Th/ 238 U 
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Regulation of the Amount of Starch in Plant 
Tissues by ADP Glucose Pyrophosphorylase 

David M. Stark, Kurt P. Timmerman,* Gerard F. Barry, 
Jack Preiss, Ganesh M. Kishoret 

Starch, a major storage metabolite in plants, positively affects the agricultural yield of a 
number of crops. Its biosynthetic reactions use adenosine diphosphate glucose (ADPGIc) 
as a substrate; ADPGIc pyrophosphorylase, the enzyme involved in ADPGIc formation, is 
regulated by ailosteric effectors. Evidence that this plastidial enzyme catalyzes a rate- 
limiting reaction in starch biosynthesis was derived by expression in plants of a gene that 
encodes a regulatory variant of this enzyme. Ailosteric regulation was demonstrated to be 
the major physiological mechanism that controls starch biosynthesis. Thus, plant and 
bacterial systems for starch and glycogen biosynthesis are similar and distinct from yeast 
and mammalian systems, wherein glycogen synthase has been demonstrated to be the 
rate-limiting regulatory step. 



tuations in initial 234 U/ 238 U ratio is proba- 
bly thousands of years), we are unable to 
take advantage of replicate analyses to di- 
minish the 234 U age uncertainties. None- 
theless, 234 U ages are more precise than the 
corresponding 230 Th ages for samples older 
than 450 ka and provide a useful check on 
the validity of the oldest 230 Th ages. The 
agreement between 234 U ages and 230 Th 
ages (Table 1) for the samples that formed 
>350 ka (and especially for those that 
formed >500 ka), independently confirms 
the 230 Th ages, and also shows that the 
growth rate of the vein for its first 100,000 
years was similar to its long-term average 
growth rate. 

Overall, the U-series ages form a re- 
markably self-consistent suite of age deter- 
minations. Because this consistency is both 
internal (from replicate samples) and exter- 
nal (from the stability of the overall age- 
distance trend), it seems highly unlikely 
that the dates have been significantly cor- 
rupted by open-system processes, such as 
uranium gain or loss or alpha-recoil phe- 
nomena. The apparent ideality of the U-Th 
system in the vein material is probably the 
result of continuous submergence in water 
that showed limited secular variation of its 
physical and chemical properties (2). 
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ment was doped with graphite before use, heated 
to running temperature in the mass spectrometer, 
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The a- 1,4 glucans (starch and glycogen) 
are the main storage carbohydrates in prac- 
tically all living systems (J). In several 
crops, starch is a major component of the 
harvest and thus directly has an impact on 
yield. Within the last 10 years, the demand 
for starch has dramatically increased for 
both specialized food and industrial uses 
(2) , primarily as a result of the development 
of high fructose com syrups and bio-etha- 
nol. A number of specialty starches (such 
as amylose and waxy starch) are being 
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increasingly recognized for their superior 
material and nutritional properties as well 
as biodegradability. Understanding the crit- 
ical components of the plant starch biosyn- 
thetic machinery therefore has a major 
impact on agriculture and industry. We 
have used transgenic plants to probe the 
rate-limiting step in starch biosynthesis. 

Starch biosynthesis occurs in the plastids 
of plant cells, involving ADPGIc pyrophos- 
phorylase (E.C. 2.7.7.27), starch synthase 
(E.G. 2.4.1.21), and branching enzyme 
(E.C. 2.4.1.18) (I, 3). In view of its 
sensitivity to ailosteric effectors, ADPGIc 
pyrophosphorylase (ADPGPP) has been 
suggested to play a pivotal role in plant 
starch biosynthesis, as it is in the bacterial 
pathway for glycogen biosynthesis. 

The Escherichia coli ADPGPP, encoded, 
by the gigC gene (4), is a regulated ho- 
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motetrameric enzyme that is activated by 
fructose 1,6-bisphosphate (FbP) (5). FbP 
increases the catalytic activity (V^), re- 
duces the Michaelis constant (K m ) for the 
substrates, and decreases the sensitivity of 
the enzyme to the inhibitors adenosine 
monophosphate (AMP) and inorganic phos- 
phate (P^ (5, 6). A mutant E. coli K12 
strain, 618, accumulates approximately 33% 
higher quantities of glycogen than its wild- 
type parent as a result of an alteration in the 
regulatory properties of ADPGPP (7). This 
mutant enzyme is less dependent on the 
activator, FbP, and is less sensitive to inhi- 
bition by the inhibitor AMP. The mutant 
gfeCI6 gene encodes a protein with a substi- 
tution of aspartic acid for glycine at position 
336 in the ADPGPP enzyme (8, 9). 

Plant ADPGPPs are tetramers that con- 
tain two distinct subunits and are regulated 
by 3-phosphoglyceric acid (PGA) and ? i as 
positive and negative effectors, respectively 
(J). Although both large and small subunits 
(JO— 13) show homology to the E. coli 
ADPGPP enzyme, the individual subunits 
do not appear to be catalytically active as 
shown by the absence of significant enzyme 
activity of the maize and Arabidopsis thahana 
starch-deficient (ADPGPP) mutants (J, 
14). In view of the difficulties involved in 
coordinating the expression of two distinct 
genes, we used the E. colt glgP gene to 
probe the regulation of metabolic flux by 
ADPGPP. Furthermore, to minimize inter- 
ference by the complex allosteric regulation 
of the wild-type E. coli gene, we used the 
mutant glgCl6 gene. 

Because starch biosynthesis occurs in 
plastids, we targeted the glgC16 gene prod- 
uct to plastids using a modified chloroplast 
transit peptide (CTP) derived from an Ar- 
abidopsis small subunit ribulose 1,5-bisphos- 
phate carboxylase (rbcS) gene (15, J 6). 
Uptake and processing of the fusion protein 
(CTP-ADPGPP) by chloroplasts was con- 
firmed by incubation of radiolabeled fusion 
protein with chloroplast preparations from 
lettuce leaves (17) (Fig. 1). 

Because the E. coli enzyme is a homotet- 
ramer and its interaction with plant 
ADPGPP subunits is not known, we deter- 
mined if the CTP-gfeC16 gene product is 
enzymatically active in plant cells using a 
transient expression system (J 8). The chi- 
meric CTP-gtgCI6 gene was introduced by 
electroporation into tobacco protoplasts 
under control of the cauliflower mosaic 
virus (CaMV)-enhanced 35S (e35S) pro- 
moter (J 9) and the polyadenylation signal 
derived from nopaline synthase (NOS) en- 
coded by the nopaline synthase gene (Nos) 
(20) (Fig. 2); the activity of the gene prod- 
uct was assayed. Extracts of protoplasts elec- 
troporated with the chimeric gene displayed 
P r resistant ADPGPP activity, whereas the 
controls had little or no activity under these 
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conditions (Fig. 3). Thus, the E. coli mutant 
gene could be expressed in plant cells, and 
the gene product retains its enzymatic activ- 
ity and phosphate insensitivity. 

To produce stable transgenic plants that 
express the CTP-gfeC16 gene, we mated 
the pMON20104 plasmid containing a 
NOS-NPTII (neomycin phosphotransfer- 
ase) gene as the selectable marker into 
Agrobacterium tumefaciens strain ASE and 
subsequently used it for the transformation 
of tobacco (2 J), tomato (22), and potato 
(23). Calli and shoots resulting from trans- 
formation and displaying resistance to kana- 
mycin were analyzed for expression of the 
CTP-g!gCI6 gene and starch content. The 
examination of transgenic and control to- 
bacco calli by light microscopy revealed 



Fig. 1. Import and processing of the CTP- 
ADPGPP fusion protein by chloroplast prep- 
arations of lettuce (Lactuca sativa var. longifo- 
lia). Molecular size markers are indicated to the 
left in kilodaltons. The CTP-glgC16 gene in 
pMON20100 was amplified by PCR with the 
use of an SP6 promoter primer (Promega, Mad- 
ison, Wisconsin) and the primer used for intro- 
ducing the Sac I site at the 3' end of the glgC16 
gene. The PCR product was purified by aga- 
rose gel electrophoresis and binding to a DEAE 



substantial differences in the number of 
starch granules (Fig. 4). Protein immuno- 
blot and quantitative starch analyses on 12 
independent transgenic and 2 control lines 
showed that calli that expressed the CTP- 
g!gCJ6 gene contained on average 10.7% 
starch (SD =^6.2, on the basis of dry 
weight), a significant increase (P = 0.0017, 
Welch analysis of variance (ANOVA) t 
test for unequal variance) over controls that 
contained 3.4% starch (SD = 0.5). One of 
the samples expressing the CTP-gIgCJ6 
gene contained 26.9% starch (Fig. 5). 
These observations establish that the 
ADPGPP reaction is the rate-limiting step 
in starch biosynthesis in tobacco cells. Even 
in tomato leaves, which typically do not 
accumulate large amounts of starch, a cor- 
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membrane The purified DNA was incubated with SP6 RNA polymerase (Promega). The resulting 
RNA was translated in rabbit reticulocyte lysate (Promega) containing [ 35 S]methionine, the 
19-amino acid mixture, RNA, and RNasin (Promega). SDS-gel electrophoresis of the resulting 
translation product demonstrated that at least two major radiolabeled proteins were produced. The 
slowest migrating band had a molecular size expected for the CTP-ADPGPP fusion protein (59 kD). 
The additional smaller bands arise from internal initiation of translation. The radiolabeled translation 
product was incubated for 1 to 10 min with intact chloroplast preparations from lettuce, obtained as 
described (17) The medium and chloroplasts in the incubation mix were separated by silicone-ort 
gradient centrifugation, and the chloroplast fraction was treated with trypsin. Both fractions were 
then treated with SDS buffer for electrophoresis. SDS-polyacrylamide gel electrophoresis was 
conducted with the use of a 3 to 17% polyacrylamide gradient. The get was fixed, soaked in 
EN3HANCE (NEN Boston, Massachusetts), dried, and imaged by autoradiography with the use of 
an intensifying screen and overnight exposure at -70°C. The 1-, 5-, and 10-min radiolabel bands 
in the chloroplast fractions were trypsin-resistant, which indicates that they were present within the 
organelle. 

Fig. 2. A schematic representation of the chi- 
meric 35S promoter-CTP-g/gC 16-Nos gene 
and the transferred DNA vector (pMON20104). 
To express the CTP-glgC16 gene in both tran- 
sient as well as stably transformed plant cells, 
we created a Bgl II site 7 bp upstream of the 
ATG translation initiation codon of the CTP in 
pMON20100. Xba I and Sac I sites were intro- 
duced after the termination codon in glgC16 \n 
the same construct. These mutations were in- 
troduced by PCR mutagenesis with the use of 
appropriate primers. The PCR product was 
digested with Bgl II and Sac I. The vector, 
pMON999, which contains the e35S promoter 
and the Nos 3' potyadenylate signal, was di- 
gested with Bgl II and Sac I. The product of the 
PCR reaction was iigated with the vector, and 
the result was transformed into £ coli MM294. 
The resulting plasmid, pMON20102, contained - ' - 

a 2 5-kb Not I fragment that consisted of the e35S promoter, CTP-glgC16 t and the Nos terminator. 
The Not I cassette was transferred to the plant transformation vector pMON530 (34), and the 
resulting plasmid was designated pMON20104. The spectinomycin resistance gene is indicated by 
specR;,< - - ' " r v ' " : --"^ ■' ■ ^ ' * " ' 
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relation was observed between the presence 
of the bacterial ADPGPP and starch con- 
tent (Fig. 4C). 

Our attempts to regenerate plants that 

^ expressed the .CTP-gIgCI6 gene under the 

^ control of the e35S promoter resulted in 
poor recovery of plants expressing the 
gene. Our interpretation of these observa- 
tions is that constitutive expression of the 
CTP-gJgC 16 gene, as directed by the 
CaMV 35S promoter, is detrimental to 

^ plant growth and development. Excess 

^ starch accumulation in the leaves may 
reduce sucrose availability for export and 
thus restrict the amount of carbon avail- 
able to the actively growing portions of 
the plant. This decrease in available su- 
crose also may affect the turgor of a num- 

^ ber of different plant cells and tissues. In 
accord with these observations, a trans- 
genic potato plant that expressed the 
e35S/CTP-glgC16/Nos gene was recovered 
that could be maintained on a sucrose- 
containing medium but could not survive 
in soil when sucrose was not provided in 

f the growth medium. 

To overcome the detrimental effects of 
constitutive ADPGPP activity, we intro- 
duced the CTP-glgCl6 gene into potato 
plants under the control of a tuber-specific 
patatin promoter (24). We recovered nu- 
merous transgenic plants (25) with a 

£ transformation efficiency typical for pota- 
to, and plants that expressed this gene had 
a normal phenotype. Specific gravity mea- 
surements, which indicate tuber density 
and thus starch content (26), show that 
tubers expressing the patatin-CTP-gIgC16 



gene contain, on average, 35% more 
starch than control tubers (Fig. 6 and 
Table 1). Some transgenic potato lines 
produced tubers with nearly 60% more 
starch than controls. This range of specific 
gravity values in control and transgenic 
lines is typical of potato and has been 
documented (27). Chemical determina- 
tion of starch content from a sampling of 
tubers confirmed that specific gravity was a 
valid reflection of starch content (Table 
2). Therefore, tissue-specific expression of 
the CT?-glgCl6 gene with the use of a 
patatin promoter results in significant in- 
creases in starch content without adverse 
effects on plant growth or development. 
Recently, similar results were obtained in a 
small-scale field trial (25), which indicates 
that ADPGPP activity is rate-limiting for 
tuber starch biosynthesis even under condi- 
tions of potato cultivation. 

Whereas the presence of the CTP- 
glgCl6 gene product was absolutely re- 
quired for increased starch production, the 
extent of starch content increase was not 
absolutely correlated with the level of 
expression of E. coli ADPGPP enzyme 
protein in all the tubers (Table 3). This 
suggests that expression of small amounts 
of the CTP-g/gCJ6 gene may be sufficient 
to overcome the limitation of ADPGlc 
availability for starch biosynthesis and to 
create a new rate-limiting step. Alterna- 
tively, substrate availability (glucose 
1 -phosphate) for the ADPGPP reaction 
may itself become rate limiting. Which of 
these mechanisms is operative under phys- 
iological conditions can now be investi- 
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gated using these transgenic potato plants. 

In vitro studies that use CommeHna 
guard-cell chloroplast extracts have shown 
that the activity of starch synthase is only 
one-twentieth that of ADPGPP and the 
branching enzymes (J, 28). These results- 
imply that starch synthase is the rate-limit- 
ing reaction in starch biosynthesis. In view 
of our finding that enhanced ADPGPP 
activity increases the starch content of 
plant cells, we reasoned that allosteric reg- 
ulation may down-regulate ADPGPP activ- 
ity under in vivo conditions. To evaluate 
the importance of allosteric regulation in 
the control of ADPGPP activity and starch 
content, we fused the wild-type E. coli glgC 




Fig. 4. (A) Polarizing photomicrograph of a 
cross section of tobacco callus tissue trans- 
formed with vector control DNA. Leaf disks of 
tobacco were transformed (21) with Agrobac- 
terium strain ASE containing the pMON530 
vector. After 2 to 3 weeks, calli formed, and 
the individual clumps were separated from the 
leaf disks. The starch granules appear as 
white, birefringent structures, (B) Polarizing 
photomicrograph of a cross section of tobac- 
co callus tissue transformed as in (A) with 
pMON20104. (C) Iodine-stained transgenic 
tomato shoot expressing the CTP-glgC16 
gene (right) and a tomato shoot not containing 
the CTP-glgCl6gene. pMON16927 was con- 
structed by ligation of the ClP-glgC1 6 chimer- 
ic gene from pMON20102 as a Bgl ll-Sac I 
fragment into the binary vector pMON977 
(34). Cotyledons of Lycopersicon esculentum 
cv. UC82B were transformed (22) by Agro- 
bacterium strain ABI containing pMON977 for 
control and pMON16927 for CJP-glgC 1 6 ex- 
pression. After 2 to 3 weeks, shoots were 
cleanly excised from the callus, decolored in 
70% ethanol for 1 hour, and stained in a 
solution of 0.2% l 2 and 0.4% Kl. 
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Fig. 3. ADPGPP activity in protoplasts. Plasmid 
pMON20104 was subject to electroporation 
into tobacco TXD protoplasts (18). Plasmid 
pMON999 was used as the control. The proto- 
plasts were separated from the medium by 
centrifugation and washed with 100 mM tris 
EDTA buffer (pH 7.5) that contained 35 mM KCI 
1 and 20% glycerol. The pellet was suspended in 
the extraction buffer [200 p.1 of washing buffer 
that contained 5 mM dithiothreitol (DTT), 1 mM 
benzamidine, and 5 mM sodium ascorbate per 
50 \i\ of protoplast volume] and sonicated for 2 

min. The resulting suspension was centrifuged to remove the insoluble pellet, and the supernatant 
was desalted on a Sephadex G-50 (Sigma) spin column equilibrated with extraction buffer. The 

( enzyme assay mix contained in 100 p.l: 10 jimol Hepes (pH 7.7), 50 \x.g of bovine serum albumin, 
0.05 M,mol 14 C-glucose 1-phosphate, 0.15 u.mol of adenosine 5 '-triphosphate, 0.5 jimol of MgCI 2 , 
0.1 u,g of crystalline yeast inorganic pyrophosphatase, 1 mM ammonium molybdate, and water. 
Fructose 1 ,6-bisphosphate (FbP) (2.5 mM) or 3-phosphoglycerate (20 mM) was added as indicated 
to activate the E. coli or plant ADPGPP enzymes, respectively. To further distinguish endogenous 
ADPGPP activity from that of the £ coli ADPGPP mutant, we added 10 mM inorganic phosphate 
(KPJ, which completely inhibits the plant enzyme but does not affect the E. coli enzyme. The 
enzymatic reaction was carried out at 37°C for 10 min and stopped by incubation in a boiling water 

I bath for 1 min. After centrifugation to remove the denatured protein, 40 ^l of supernatant was loaded 
on a SynChropak AX1 00 (SynChrom, Lafayette, Indiana) column (250 x 4.6 mm) and eluted with 65 ' 
mM KPj (pH ,5.5) at a flow rate of 1 ml per minute. The radioactivity of the eluate was followed with 
the use of a Radiomatic (Radiomatic Instruments, Tampa, Florida) detector. Glucose 1-phosphate 
eluted at 8.3 min, and ADPGlc eluted at 14 min under these conditions. The radioactivity in the 
ADPGlc peak was a measure of the ADPGPP activity of the sample. Protein content of the extract - 
was measured by the Bradford method (35) with the use of the Bio-Rad reagent.. ^ ^ t 
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gene with the CTP and expressed it in 
potato tubers and in tomato leaves using 
the patatin and Arabidopsis rfccS promoters, 
respectively. Unlike the GlgC16 enzyme, 
wild-type GlgC is fully subject to allosteric 
regulation (4). However, the primary effec- 
tor molecules differ between the plant and 
bacterial enzymes. Thus, depending on the 
relative concentrations of these effectors in 
vivo, expression of the CTP-glgC gene may 
also result in increased ADPGPP activity 
and starch content in a plant cell environ- 
ment. Expression, enzyme activity, and 
plastid targeting from the cloned glgp gene 
were confirmed before plant transformation 
(29). A large number of tomato plants 
transformed with the rbcS-CTP-glgC gene 
were obtained and were phenotypically nor- 
mal. Analyses of leaf tissue showed high 
levels (0.1% of total protein) of expression 
of the CTP-glgC gene but only a slight 
effect on starch content (Table 4). The 
increase in average starch content of 1 1.2% 
was not significant (t test, P = 0.3111). By 
contrast, recovery of plants with leaf ex- 
pression of CTP-glgCJ6 was very poor, sim- 
ilar to the lethal effect seen with the e35S 
promoter. 

In potato tubers, expression of CTP-glgC 
did not result in a noticeable increase in 
starch content (Table 5), even though ex- 
pression levels were equivalent to those in 
tubers that expressed CTP-gfeCI6. We con- 
clude that CTP-gigC expression may lead to 
a slight, but not significant, amount of 
starch increase in plant tissues. The kinetic 
characteristics of the GlgC and GlgC 16 
enzymes have been reported (7-9) and dif- 
fer primarily in allosteric regulation. The 
Vmax of each enzyme is the same under fully 
activated conditions. On the basis of these 
features, our work represents further evi- 
dence that ADPGPP activity is rate limit- 
ing in starch biosynthesis and that it is the 



Table 1. Average specific gravity and starch 
content is increased in tubers that express the 
CTP-c;/gC76gene. The number of independent 
plant lines tested is indicated, with two or three 
tubers per plant weighed. Specific gravity was 
determined by the weight-in-air-weight-in-wa- 
ter method (26). Sample standard deviations 
(SD) of the specific gravity measurements are 
indicated. Percent starch was calculated from 
the average specific gravity as described (26). 
Controls consist of a combination of Russet 
Burbank tubers transformed to contain only the 
vector DNA and tubers from the glgC16 trans- 
formation event that do not express E. coti 
ADPGPP. . . 
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regulatory properties of ADPGPP, not the 
amount of enzyme protein, that make it 
rate limiting. 

We investigated the effect of cytosol-tar- 
geted expression of GlgC 16 by localizing the 
gjgCJ6 gene product within the cytosol of 



Fig. 5. Starch content (A) and protein immuno- 
blot analysis (B) of tobacco calli. Samples 1 
through 6 express the gigC16 gene, sample 13 
is a control lacking the glgC16 gene, and the 
last two lanes are sample 13 with either 2 or 10 
ng of GlgC16 protein purified from £ cofi. 
Starch levels in transformed tobacco calli were 
quantitated with the method of Lin et a/. (14). 
Differences are significant with the use of the 
Welch AIMOVA t test for unequal variance (P = 
0.0017). For protein immunoblot analysis, a 
portion of the dried, homogeneous callus from 
each of the experimental and control samples 
was suspended in 200 \t\ of extraction buffer 
[100 mM tris-CI (pH 7.1), 1 mM EDTA, 10% 
glycerol, 5 mM DTT, and 1 mM benzamidine]. 
We ground each sample to extract the protein 
and centrifuged the samples to remove insolu- 
ble debris. The protein concentration of the 
supernatant was estimated with the use of 
Bradford reagent, and 25 \lq of protein from 



potato tuber cells. Amounts of gfeCJ6 gene 
product were slightly smaller than amounts of 
the CTP-gfeCI6 gene product (0.02 to 0.05% 
versus 0.02 to 0. 1% of total tuber protein) but 
were within the range of expression seen in 
high-starch CTP-gJgC 16 tubers. Cytosolic lo- 




each sample was loaded on SDS-polyacrylamide gels, with a 7 to 17% gradient. After electropho- 
resis, the gels were blotted onto nitrocellulose, and the £ coli ADPGPP was visualized with the use 
of rabbit anti-ADPGPP as the primary antibody and goat anti-rabbit serum conjugated to alkaline 
phosphatase as the secondary antibody. The reaction of alkaline phosphatase with nitro blue 
tetrazolium and 5-bromo-4-chloro-3-indoyl phosphate was used for color development. 



Fig. 6. Specific gravity of potato tubers express- 
ing CTP-glgC16. We generated pMON201 1 3 by 
replacing the CaMV 35S promoter in pMON999 
with a 1-kb Hind Ill-Bam HI fragment that con- 
tained a portion of a class I patatin promoter 
(24). The gtgC16 gene from pMON20104, in- 
cluding the chloroplast transit peptide derived 
from the ribulose 1 ,5-biphosphate carboxylase 
small subunit 1 A gene of Arabidopsis, was then 
added as a Bgl ll-Sac I fragment, and the entire 
patatin promoter-CTP-g/gCt6-NOS 3' end cas- 
sette was cloned as a Not I fragment to the 
binary vector pMON886 (34). Potato var. Russet 
Burbank was transformed with Agrobacterium 
strain ASE containing the pMON20113 vector 
(23). Control transformations were performed 
with the use of the vector pMON886. Tubers 

from regenerated plants were screened for the expression of GlgC16 by protein immunoblot (Fig. 5). 
Fifteen CTP-g/gC76-positive and 21 control plants were screened, and specific gravity was 
determined for two to three tubers per plant. Solid and hatched bars represent the numbers of tubers 
for those expressing £ co// ADPGPP and for control tubers, respectively, that fell within the indicated 
range of values. Controls consisted of a combination of tubers transformed to contain only the vector 
DNA and tubers from the glgC1 6 transformation event that do not express £ coli ADPGPP. 

Table 2. Average values for percent starch determined experimentally by enzymatic degradation 
(14) compared to starch content calculated from specific gravity measurements. Sample standard 
deviations are in parentheses. Differences between the methods for starch determination are not 
significant However, differences between CTP-GlgC16+ and controls, calculated by specific gravity 
or enzymatic methods, are significant by the t test (P < 0.0001). Tuber types are as in Table 1 ... 




Range, specific gravity 



Tuber type 



Sample size 



Average % starch 
specific gravity 



Average % starch 
enzymatic 



CTP-GlgC16 * 15 
Control ' " 21 



1,088 0.012 
1.068 0.010 



15.4 
11.4 



CTP-GlgC16 
Control 



11 
11' 



16.3 (1.47) 
11.9 (1.56) 



16.0 (2.00) 
12.3 (1.15) 
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Table 3. Comparison of expression levels of CTP-gfgC16 and their effects on starch content. 
CTP-glgC16 levels were estimated from protein immunoblot analysis by comparison to known 
standards. Starch content was determined by the enzymatic degradation method. One tuber 
[Russet Burbank (R.B.)] from each line was used for both analyses. Control starch levels represent 
the average value for 12 tubers. 



Tuber type 


Range 
(ng of CTP-glgC16 
per 50 \lq of protein) 


Range 
(% starch, 
fresh weight) 


R.B. control 


0 


5.4 to 14.4 


R.B. + high GlgC16 


26 to 50 


14.4 to 19.2 


R.B. + medium GlgC16 


10 to 25 


11.8 to 19.1 


R.B. + lowGlgC16 


0.5 to 10 


8.8 to 17.4 



Table 4. Starch content of tomato leaves ex- 
pressing CTP-glgC. The wild-type CTP-glgC 
gene was engineered for expression in plants 
essentially as described for the CTP-glgC16 
gene (16) (Figs. 2 and 6). The CTP-a;/gC gene 
was added to a derivative of the binary vector 
pMON977 (34), which contains the Arabidop- 
sis promoter from the rt>cS gene ( 15) in place 
of e35S, resulting in the plasmid pMON16938. 
Lycopersicon esculentumcv. UC82B (L.e.) was 
transformed (22) with Agrobacterium strain ABI 
containing the vector pMON 16938. The subse- 
quent R1 generation was screened for the 
expression of CTP-GlgC in leaves by protein 
immunoblot. Standard deviations (SD) are indi- 
cated. Starch content was determined (14) in 
leaf tissue taken from plants at the end of the 
light cycle. Values are not significantly different 
(t test, P= 0.3111). 



Leaf type 


Plants 
(") 


Average % 

starch 
(fresh weight) 


SD 


La + CTP-GlgC 
L.e. control 


14 
10 


9.3 
8.4 


1.8 
2.7 



calization of the gJgC16 gene product, howev- 
er, did not result in a significant effect on 
starch content (Table 6). 

Recently, there has been controversy 
concerning the subcellular location of 
ADPGlc synthesis in plant cells. Because 
under in vitro conditions cytosol-localized 
sucrose synthase uses ADP as a substrate 
and plastids import ADPGlc, it has been 
suggested that sucrose synthase serves as a 
source of ADPGlc for starch biosynthesis in 
plant cells and tissues (30). However, in 
view of our demonstration that only the 
expression of CTP-glgC 16, not g!gCJ6, re- 



sults in an increased starch content, it is 
unlikely that this mechanism contributes 
significantly to starch biosynthesis. Further 
evidence in support of the plastidial 
ADPGPP pathway comes from recent work 
that shows that reduction of the plant 
ADPGPP enzyme level by means of an- 
tisense RNA results in decreased starch 
formation (31). 

Because potato tubers that express the 
CTP-glgC 16 gene contain more starch, it 
would be interesting to determine if this 
is a result of the ability of the potato plant 
to provide additional photosynthate to the 
tuber or a result of a reduction in water 
content of the tuber due to rapid conver- 
sion of the osmotically active sugars to 
starch. Evidence exists that plants contain 
unused photosynthetic capacity that could 
become available should the demand be 
present (32). This is supported by numer- 
ous reports that show that increased car- 
bon demand results in increased assimila- 
tion in and output from the leaves 
(33). 

ADPGPP has been suggested to influ- 
ence the ability of tubers to import carbon 
and convert it into starch — that is, the 
rate of starch accumulation regulates the 
rate of carbon import (I); this hypothesis 
is now supported by our study. The trans- 
genic potato plants described here will 
allow the testing of models that suggest 
that photosynthetic carbon assimilation is 
not limiting to yield. It is interesting that 
a single enzymatic step regulates the end- 
product levels to such a dramatic extent in 
a complex multicellular organism. With a 
similar approach, it should be possible for 



Table 6. Specific gravity and starch content of 
potato tubers that express GlgC16 in the cyto- 
sol. A cytosolic version of the glgCW gene was 
created by ligation of the coding region from 
pMON20100 as a Nco I and Sac I fragment into 
a version of pBluescript kS+ (Stratagene, La 
Jolla, California) engineered to contain Bgl II 
and Nco I sites in the polylinker. The coding 
region was then ligated as a Bgl II and Sac I 
fragment into the binary vector pMON16952, 
which contains the class I patatin promoter, 
resulting in the plasmid pMON 16971. Potato 
var. Russet Burbank was transformed with 
Agrobacterium strain ABI containing the 
pMON16971 vector (23). Tubers formed from 
regenerated plants were analyzed as de- 
scribed (Table 1). 



Tuber 
type 


Plant 
lines 


' Average 
specific 
gravity 


SD 


Average 

% 
starch 


GlgC16+ 
Controls 


26 
22 


1.073 
1.073 


0.010 
0.008 


12.4 
12.4 



researchers to manipulate the amounts of a 
number of key metabolites such as lipids, 
amino acids, and carbohydrates by affect- 
ing or altering a major regulatory step in 
the biosynthesis of relevant metabolites. 
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Intercellular Propagation of Calcium Waves 
Mediated by Inositol Trisphosphate 

Scott Boitano, Ellen R. Dirksen, Michael J. Sanderson* 

Two types of calcium (Ca 2+ ) signaling— propagating intercellular Ca 2+ waves of in- 
creasing intracellular Ca 2+ concentration ([Ca 2+ ],) and nonpropagating oscillations in 
[Ca 2+ ]j— co-exist in a variety of cell types. To investigate this difference in Ca 2+ sig- 
naling, airway epithelial cells were loaded with heparin, an inositol 1 ,4,5-trisphosphate 
(IP 3 ) receptor antagonist, by pulsed, high-frequency electroporation. Heparin inhibited 
propagation of intercellular Ca 2+ waves but not oscillations of [Ca 2+ ],. In heparin-free 
cells, Ca 2+ waves propagated through cells displaying [Ca 2+ ]j oscillations. Depletion of 
intracellular Ca 2+ pools with the Ca 2+ -pump inhibitor thapsigargin also inhibited the 
propagation of Ca 2+ waves. These studies demonstrate that the release of Ca 2+ by IP 3 
is necessary for the propagation of intercellular Ca 2+ waves and suggest that IP 3 moves 
through gap junctions to communicate intercellular Ca 2+ waves. 



Intercellular communication is essential for 
the function of multicellular systems, but 
the nature of the signal or signals that pass 
between cells through gap junctions is not 
fully established. Both Ca 2+ and IP 3 have 
been proposed as intercellular messengers 
(J -3). Nonexcitable cells often respond to 
agonists by increasing their [Ca 2+ ]j in an 
oscillatory manner (4), but these oscilla- 
tions in [Ca 2+ ] i occur independently of 
[Ca 2+ ]i changes in adjacent cells (3). In 
contrast to Ca z+ oscillations, a propagating 
intercellular wave of increased [Ca 2+ ] i (a 
Ca 2+ wave) can be initiated by mechanical 
stimulation of a, single cell in cultures of 
airway epithelial (2), rat brain glial (3, 5, 

Department of Anatomy and Cell Biology, UCLA 
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*To whom correspondence should be addressed. 
292 



6) , or bovine aortic endothelial (7) cells or 
by treatment of astrocytes with glutamate 
(8). In airway epithelial cells Ca 2+ waves 
are blocked by the gap junction inhibitor 
Ca 2+ halothane (2), and in C6 glioma cells 
only cells transfected with and expressing 
the gene for the gap junction protein con- 
nexin43 propagate Ca 2+ waves (6). These 
results indicate that Ca 2+ waves are propa- 
gated through gap junctions (1-3, 5-3). A 
role for IP 3 in the communication of Ca 2+ 
waves has been proposed because Ca 2+ 
waves are propagated in the absence of 
extracellular Ca 2+ (2, 3, 7), are propagated 
when Ca 2+ -induced Ca 2 t release is inhib-- 
ited (5), and can be initiated by microin- 
jection of IP 3 (2). If IP 3 acts as the inter- 
cellular messenger for propagation of Ca 2+ 
waves, intracellular heparin, an antagonist 
of the IP 3 receptor (9), should block or 
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attenuate the Ca 2+ wave (9). 

Traditional loading techniques, such as 
microinjection, are not well suited for load- 
ing the large numbers of cells required for 
investigation of this multicellular response. 
Therefore, we used pulsed high-frequency 
electroporation (PHFE) (10, II) to load 
cultured cells with heparin (Fig. 1). Because 
heparin cannot be detected by fluorescence 
microscopy, cells were simultaneously load- 
ed with fluorescent Texas red-conjugated 
dextran (TRD) to identify cells that incor- 
porated heparin (Fig. 1) (II, 12). PHFE has 
a major advantage over current-discharge 
electroporation in that most cells survive 
(JO). After PHFE, and loading cells with 
fura-2 by incubation in fura-2-pentaacetoxy- 
methyl ester (fura-2- AM) (12), more than 
90% of the heparin-loaded cells retained 
fura-2 and heparin-loaded ciliated cells con- 
tinued to display ciliary activity. These re- 
sults demonstrate the potential of PHFE for 
loading cells with molecules that are imper- 
meable to the cell membrane because of 
their large molecular size or ionic charge. 

In an area where all cells were loaded 
with heparin, mechanical stimulation of a 
single cell increased [Ca 2+ ) i in the stimu- 
lated cell but did not initiate propagation of 
a Ca 2+ wave through multiple adjacent 
cells, even though the increase in [Ca 2+ ] i of 
the stimulated cell ranged from 300 nM to 
>1 \xM (Fig. 2A). In a few of these 
experiments in the heparin-loaded area, 
[Cal + ]i increased in single cells directly 
adjoining the stimulated cell, but an in- 
* crease of [Ca 2+ ] i in more distal cells was not 
observed. The increase in [Ca 2 " 1 "^ of the 
stimulated, heparin-loaded cells, and our 
reports that the [Ca 2+ ] i of a mechanically 
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Cloning of the yeast FAS3 gene and primary structure of yeast 
acetyl-CoA carboxylase 

(acetyl-CoA carboxylase gene/biotin carboxylase/transcarboxylase) 
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ABSTRACT We have isolated and determined the nucle- 
otide sequence of the yeast FAS3 gene, which encodes acetyl- 
CoA carboxylase (EC 6.4.1.2). The sequence has an open 
reading frame of 6711 bases coding for a protein of 2237 ammo 
acids with a calculated molecular weight of 250,593. The 
presence of the unique biotin-binding site, Met-Lys-Met, and 
the known CNBr peptide and COOH-terminal sequences con- 
firmed the nudeotide-derived amino acid sequence. The yeast, 
chicken, and rat carboxylases have an overall sequence identity 
of 34%, suggesting that the eukaryotic carboxylase evolved 
from a single ancestral gene. The amino acid sequences of yeast 
fatty acid synthase subunits are least homologous with the 
animal synthase sequences, whereas carboxylase sequences are 
highly conserved. The sequences of the ATP, HC0 3 , and CoA 
binding sites of the carboxylases are also well conserved (=50% 
identical). The sequences surrounding the biotin binding site 
are poorly conserved, suggesting that this sequence may not be 
critical as long as the biotin is available for carboxylase 
reactions. On the basis of this sequence identity, we have 
defined the putative biotin carboxylase and transcarboxylase 
domains. 

Acetyl-CoA carboxylase (ACC; EC 6.4.1.2) catalyzes the 
committed step in fatty acid biosynthesis, yielding malonyl- 
CoA, the donor of the two-carbon units for the synthesis of 
long-chain fatty acids (1). In prokaryotes, the enzyme con- 
sists of three readily dissociated proteins, the biotin carboxyl 
carrier protein (BCCP), the biotin carboxylase, and the 
transcarboxylase (2), whereas in higher and lower eukaryotic 
cells, these proteins are part of a single multifunctional 
polypeptide derived from the expression of a single gene that, 
presumably, evolved by the fusion of individual genes. Re- 
cently, the cDN As coding for the rat (3) and chicken (4) ACC 
were cloned and sequenced. The sites for the biotin attach- 
ments in both carboxylases, which are conserved as in all 
other biotin-containing enzymes, were readily identified. 
Putative domains for the two catalytic functions were as- 
signed (3, 4) and sites for phosphorylation were located (5). 

ACC has been isolated from Saccharomyces cerevisiae (6) 
and Candida lipolytica (7) and shown to contain single- 
subunit proteins of molecular weights 189,000 and 230,000, 
respectively. To understand the relationship between struc- 
ture and function of the yeast ACC and to utilize the potential 
of genetic manipulations in yeast, we have undertaken a 
systematic analysis of the enzyme. In this report, we describe 
the isolation and nucleotide sequence of the FAS3 gene,* 
which encodes yeast ACC, and compare the deduced amino 
acid sequence with sequences of the rat and chicken enzymes 
to determine their evolutionary relationship. 



MATERIALS AND METHODS 

Preparation of ACC and Its Antibodies. Yeast ACC from 
extracts of baker's yeast was purified to a state of homoge- 
neity. Cell-free extracts were prepared, as described previ- 
ously (8), and the enzyme was isolated by ammonium sulfate 
fractionation (0-28% saturation), polyethylene glycol 8000 
(0-6%) precipitation, and avidin-Sepharose affinity chroma- 
tography (9). The carboxylase preparation had specific ac- 
tivity of 2.5 units/mg protein when assayed by [ C]bicar- 
bonate incorporation into malonyl-CoA, as described (9). 
Antibodies against the purified ACC were raised in rabbits, 
and affinity-purified anti-ACC antibodies were prepared (10). 

Isolation and Sequencing of the FAS3 Gene. The yeast 
genomic DNA libraries in Agtll and EMBL3a vectors were 
provided by M. Snyder (Yale University) (11). The Agtll 
library was screened with anti-ACC antibodies by following 
standard procedures (11, 12), and the EMBL3a library was 
screened with radioactive DNA probes as described (13). The 
yeast strain SEY2102 was grown in appropriate media and 
total RNA was isolated for Northern analysis (14). 

DNA sequencing was performed by using both the dideoxy- 
nucleotide termination method (15), as described previously 
(14), and an automated DNA sequencer 'Applied Biosystems 
model 370A), according to manufacturer's recommenda- 
tions. All the restriction enzymes and other chemicals were 
purchased from commercial and standard sources (9, 14). 

RESULTS AND DISCUSSION 

Isolation and Expression of Agtl lace. The native yeast ACC 
is a tetramer of identical subunits each having an estimated 
molecular weight of 250,000 (Fig. 1). Affinity-purified anti- 
ACC antibodies were used to screen a yeast Agtll genomic 
DNA expression library (11, 12). A positive clone, Agtllacc. 
was isolated and shown to contain a 3.0-kilobase-pair (kbp) 
fragment of the putative ACC genomic DNA. Initial verifi- 
cation came from DNA sequence analysis of an Sst l-Sst I 
fragment from Agtllacc (see Fig. 3). The nucleotide-denved 
amino acid sequence showed a high degree of sequence 
identity with animal ACC DNA sequences, and, on the basis 
of this homology, it was concluded that the Agtllacc clone 
contained the portion of the FAS3 gene coding for the COOH 
terminus of yeast ACC. The identity of this clone was venfiec 
further by immunoblotting (16) of protein lysates fron 
Agtllacc/ Y1089 lysogens (10). As shown in Fig. 1, a fusion 
protein (Af r - 180,000) produced by the Agtllacc recombi- 
nant phage reacted with the anti-ACC antibodies. On the 
basis of the sizes of the fusion protein and the /3-galacto- 
sidase, the carboxylase gene fragment in Agtllacc coded tor 
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Fig. 1. SDS/PAGE and Western 
blot analyses of yeast ACC and the 
/3-galactosidase-fused protein pro- 
duced by Agtllacc. (A) Coomassie 
blue-stained 5% polyacrylamide de- 
naturing gel. Lane 1, protein molec- 
ular weight standards (myosin, 
200,000; 0-galactosidase, 116,000; 
and phosphorylase b, 97,000); lane 2, 
yeast ACC (5 /ig); and lane 3, ceil 
lysate obtained from Agtllacc lyso- 
gen. {B) Western blot analysis of a 
similar gel with anti-yeast ACC anti- 
bodies. Lane 4, yeast ACC; and lane 
5, Agtllacc cell lysate. 



a protein of about M T 66,000. However, the sizes of the 
expressed protein and the cloned genomic DNA fragment 
were smaller than the carboxylase subunit protein of A/ r 
250,000 and the expected size of about 7.0 kbp of the ACC 
gene, respectively. To determine if Agtllacc hybridizes to a 
high molecular weight RNA, Northern analysis (17) was 
performed using total yeast RNA (13) in conjunction with a 
32 P-labeled £coRI fragment from Agtllacc. As shown in Fig. 
2, the 0.6-kbp yeast DNA fragment isolated from Agtllacc 
hybridized to an mRNA of 7.5 kilobases (kb), which is larger 
than 6.6-kb FAS1 mRNA (14) and is consistent with the sizes 
of yeast ACC and /3 subunit of yeast fatty acid synthase. 

A 1.9-kbp Kpn I restriction fragment from Agtllacc (Fig. 3) 
was used as a probe for screening an EMBL3a yeast genomic 
library according to standard procedures (13). A clone, 
EMBL-ACC, was isolated and shown to contain a yeast DNA 
insert of about 14 kbp. Southern analysis (data not shown) 
indicated that clone EMBL-ACC contained the entire 7.3- 
kbp coding region together with flanking noncoding regions 
of the yeast ACC gene. 

Sequence Analysis of the ACC Gene. The restriction map 
and sequence strategy used in the structural analyses of the 
ACC coding region are outlined in Fig. 3. The DNA was 
sequenced by using standard procedures (18). More than 95% 
of the sequence was confirmed by sequencing both strands. 
The remaining sequences were confirmed by sequencing the 
fragments more than twice in the same direction. 

The nucleotide sequence of the DNA encoding the ACC 
and the derived amino acid sequence are shown, in Fig. 4. 
Starting with the first ATG (Met) codon at nucleotide 1, the 
nucleotide sequence has an open reading frame of 6711 bases 
coding for a protein of 2237 amino acids having a molecular 
weight of 250,593. There are no introns in the entire se- 
quence, since the highly conserved intron-specific sequence 
TACTAAC is not present. In all three reading frames the 
nucleotide sequence upstream of the putative initiation codon 



Fig. 2. Northern blot analyses 
of total yeast RNA (10 pig). Lane 
1, yeast RNA hybridized with 0.6- 
kbp £ce>RI fragment prepared 
from Agtllacc; and lane 2, RNA 
hybridized with a 2.8-kbp Hind- 
Bam DNA fragment obtained 
from FASi in YEP33F1 (14). 
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Fig. 3. Restriction map and sequence determination strategy of 
yeast ACC genomic DNA (ACC) cloned in EMBL3a. EcoRl frag- 
ments pi. 5 and p3.5 were subcloned in the pBluescript vector 
(Stratagene). The Sst I fragment from the yeast DNA in Agtllacc was 
isolated and subcloned in PUC 19: Only the restriction sites used for 
sequencing are indicated. Each fragment was sequenced to the extent 
of each of the arrows in the direction shown. Arrows with nothing or 
closed circles at the unpointed ends denote sequencing using internal 
restriction sites and appropriate oligonucleotide primers, respec- 
tively. Hatched bars indicate the coding segment. The bracketed 
areas in p3.5 and Agtllacc-St are the overlapping regions. Restriction 
sites: B, BamH I; Bg, Bgl II; E, EcoRl; Ev, EcoRV; H, Hindlll; He, 
Hindi; K, Kpn I; P, Pst I; St, Sst I; E*, EcoRl linker not in gene. 

terminated (data not shown), indicating that the coding 
sequence can start only with this Met codon or one of the 
internal (downstream) Met codons. When the rat and yeast 
amino acid sequences were aligned, we found that the linear 
homology starts from amino acid residues 126 and 68 in the 
rat and yeast, respectively, allowing only the Met codons 
present in the first 68 amino acids as likely candidates for 
translation initiation sites. Thus, the first Met codon and/or 
a Met a residue 14, both of which have a purine at the -3 
position (19), are probable translation initiation sites, and for 
now, we are considering that it is the first ATG. The open 
reading frame ends at residue 2237 with Leu-Lys, consistent 
with the COOH-terminal sequence of yeast ACC, as reported 
by Lynen (20). We have sequenced a peptide isolated from 
yeast ACC after cleavage with CNBr that exactly matches the 
sequence from amino acid residues 2019 to 2026 (underlined 
sequence in Fig. 4). The conserved biotin-binding site, Met- 
Lys-Met, is located between amino acid residues 734 and 736. 

Protein Structure and Functional Domains. ACCs are biotin 
enzymes that are highly conserved, so much so that antibod- 
ies prepared against rat, chicken, and yeast enzymes cross- 
react with each other (unpublished results). The rat (3) and 
chicken (4) ACCs, which consist of 2345 and 2321 amino 
acids, respectively, are highly homologous (90% identical), 
despite the evolutionary diversity, and, hence, little infor- 
mation can be derived by comparing the two sequences. The 
overall sequence similarity between the rat enzyme and the 
yeast carboxylase is about 34%, which is significant consid- 
ering their evolutionary divergence. Further, along these 
sequences are several stretches of various lengths that are 
about 80% conserved, with some segments even 100%. 

Closer examination of the amino acid sequences of rat and 
yeast enzymes showed other variable regions as well as 
regions of high similarity, as illustrated in Fig. 5. For one, the 
amino acid sequence of the yeast ACC is shorter than that of 
the rat enzyme by 108 residues. The shortage occurs near the 
NH2 terminus, where there are stretches of 50 and 8 amino 
acids missing, and the COOH terminus, where the enzyme is 
20 residues shorter than the rat ACC. There are also notable 
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ATGAGCGAACAAAGCTTATTCGACTCTTCTCCACAGAACATGGAG^ 

u J , f c.wi,^i.,c. rLJ ,.,(.^-nc a rc J irPr n ffi B i y.^r r.UiTyrCl ul 1 eTht / 1 «-^."<;i»rCluArflH i aThrCl tiLauProGl vHi aPhel leGlyLeu 
AATACAGTACATAAACTAGAGCAGTCCCCCTrAAGCCAC^^ 

AsnTJirValAspLysLeuCluGluSerProLeuAxgAspPheValLysSerHi aClyGlyHisThrVa] I leSerLyal leLeuI leAl&AanAanGly 1 leAla 
CCCCICAAAGAAATTAGATCCSTCAGJUUUVTGGCCATACCA^ 

AlaValLyaCluIleArgSerValArgLysTrpAlaTyrGluTOrPheClyAapAspArgThr^^ 
AACGCACAATATATCCCTATGGCCGATCAATACATTCAAGTGCCAGGTCGTACT^ 

AsnA 1 aCl uTV r 1 1 e ArgMe t A 1 aAspC InTyrI led uVa 1 ProC 1 yC 1 yTh r AsnAsnAsnAsnTy rA 1 aAsnVa 1 AspLeu 1 1 eVa 1 Asp 1 1 «A 1 aC 1 uAr g 
GCAGACXrrAGACGCtOTATGGCCTGGCTGGGGTCACGCCTCCC 

A 1 aAspVa 1 AspA 1 a Va 1 TrpA laG 1 ylrpC 1 y H i s A 1 aSe rG 1 uAsn Pro LeuLeu ProG 1 uLy a LeuS e rG inSe r Ly s ArgLysVal IlePhelleGlyPro 

CCACGTAACGCCATGAGGTCTTTAGGTGATAAAATCTCCTCTAC^ 
ProGlyAsnAlaHetArgSeri^uClyAspLysIleSerSerThrU 

GTTCACGTCGACGAGAAAACCCClUUj<Vl^'TtlI&lCC^ 

Va 1 H i eVa 1 AspG luLy sTh rG 1 y LeuVa 1 Se rVa 1 AspAspAsp 1 1 eTy rG 1 nLy sG 1 y Cy sCy sThrSe rPr oC 1 uAspG 1 y LeuGlnLyoAlaLyaArglle 

ClyPheProValHetIleLyflAlaSeiCluQlyaiy01y«lyItf»aiy^ 

CAAftTTCCAGGCTCCCCCATTTTCATCAltyVACTrcGCCGCT 
GluIleProGlySerProIlePheIleMetLysLeuAlaGlyArgAlaArgHisL*u^^ 

GGTAGAGACTCTrcCCTTCAGAGACCTCATCAAAAAATTATCCAAGAAGCACCACTTAC^ 

GlyArgAspCysSerValGlnArgArgHisGlnLysIlelleGluGluAlaProValT^rHeAlaLyaAlaGlu^rPheHisCluMecGluLyBAlaAlaVal 

ACACTGGGGAAACTACTCGGTrATCTCTCTGCCGCTACCGTCGAGTAT^ 
ArgLeuClyLysLeuValGlyTyrValSerAlaGlyThrValGluTyrl^uTyrSerHisA 

GTCGAGCATCCAACAACGGAAATOriXrrcCCGTGTTAACTTACC^ 
ValCluHisProThrThrCluMetValSerGlyValAsnLeuProAlaAlaGln^ 

TTATATXXn-ATCAATCCTCATrcTGCCTCAGAAATCGATTOGAATTCAAA^ 
LeuTyrGlyMetAsnProHisSerAlaSerCluIleAspPheGluPheLys 

GCTTCTCCTATCACATCAGAAGATCCAAACGATGGATTCAAGCCATC^ 
AlaCysArgneThrSerGluAspProAsnAspClyPheLysProSerGlyGly^ 

TCCCTCGGTAACAATOTAATATTCA ClVLl ' l -l-l X- GGACTCrrCAGTTCGGCC 

SerValGlyAsnAsnGlyAsnlleHisSerPheSerAspSerGlnPheGlyHisIlePheAlaPheClyGluAsnArgGlnAlaSerArgLysHisMetValVal 
GCCCTGAAGGAATTCTCCATTAGGGGrGATTrCAGAACTACTC 

A 1 a LeuLy sG 1 uLeuSe r 1 1 eArgG 1 yAspPheArgTh rTh rVa X C 1 uTy r Leu 1 1 e Ly sLeuLeuG 1 uTfcrG 1 uAspPheG 1 uAspAsnThrlleThrThrCly 

TGGTTCGACGATITGATTACTCATAAAATCACCGCTGAAAAGCCT^ 
Trpl*uAspAspI,euIleT*rHisLysMetThrAlaCluLysProAspProTh^ 

GCCCGCCACAAGTATATCGAATCCTTACAAAAGGGACAACTTCT 

A 1 aArgH i s Ly sTyr 1 1 eGl uSer LeuG 1 nLy sG 1 yG 1 nVa 1 LeuSe r Ly sAspLeuLeuG InTh rMe t PheProVa 1 AspPhe 1 1 eH i sG 1 uG 1 y Ly sArgTyr 
AAGTTCACCCTAGCTAAATCCGGTAATGACCCTTACACA™ 

LysPheThrValAlaLysSerGlyAsnAspArgTyrThrLeuPhel le&saQl^SfiiLysCysAspI lei leLeuArgClnLeuSerAspGlyGlyLeuLeuI le 
GCCATAGGCCGTAAATCGCATACCATCTATIGGAAAGAAGAAGTTT^^ 

AlalleGlyClyLysSerHisl^rl leTyrTrpLysGluGluValAlaAlaThrArgLeuSerValAspSerMetThr^rLexiLeuGluValGluAsnAspPro 
ACCCAGTrcCCTACTCCATCCCCTCOTAAATrGG^ 

ThrClnl^uArgThrProSerProClyLysLeuValLysPhel^uValGluAsnGl^^ 

ATGCAAATGCCT i ' llXJ ' 1 'l'll. l^ AAGAAAATGCTATCGTCCAGTTArrAAAGCAACC^ 
MetClr.HetProLeuValSerGlnGluAsnGlylleValGlnLeuLeuLysClnProGlyS 

GATCCATCCAAGGTCAAGCACCCTCTACCATTTGAAGGTATCCTCCCAGAl'l-ril^'l'll. 1VCAGTTATCGAACGAACCAAACCTOCTATAAATTCAAGTCATTA 
AspProSerLysValLysHisAlaLeuProPheGliiGlyMetl^uProAspFheGlySerProVallleGluGlyThrLysProAlaTyrLysPheLysSerLeu 

CTGTCTACTTTGGAAAACATTTTGAAGGGTrATGACAACCA^ 

Val£cr^^^LcuClu^rnneL"'■l^"^ 1 V' ^ l"^ Ae ^ fcer ' r - 1r>V;>^ TlAMgr A^.ATaCgrLeuGlnGlnLeuIlgGluVaH.euArgAsnProLvsLeuProTyr 

TCAGAATCGAAACTACACATCTCTCCTTTACATTCAAGATrGCCTGCT 
SerGluTrpLysLeuHisIleSerAlal>eumsSerArgL<uProAlaLyslxn^ 

TTCCCAGCTAGACAATTA-AGTAAATTGATTGATATGCCCGTCAAGAATC 
PheProAlaArgClnUuSerLysl^uIleAspMetAlaValLysAsnProGluTyrAanP^ 

CCTCATAACTACTCTAACCGGTTAGAAGCCCATGAACATTCTATO 

AlaHisLysTyrSerAsnGlyl^uCluAlaWisGluHisSerllePheValHisPheLeuCluGluTyrTyrCluValCluLysLeuPheAsnGlyP 
CGTGAGGAAAATATCATrcTGAAATTGCGTGATCAAAACCCTAAAGATCTAGA 

Ar gG 1 uG 1 uAsn 1 1 e 1 1 e LeuLy s LeuAr gAspG 1 uAsnPr o Ly sAspLeuA spLy s Va 1 A 1 a LeuTh rVa 1 UuSe r H i sSe r Ly s V alSerAlaLysAsnAsn 
CTGATCCTAGCTATCTTGAAACATTATCAACCATTGTGCA^ 

Leu 1 1 e LeuA 1 a 1 1 e LeuLy s H i sTy rG 1 n Pr oLeuCy s Ly s LeuSe r Ser Ly s Va 1 Se r A 1 a 1 1 e PheSe rTh rPro LeuG 1 nH i s I leValGluLeuGluSer 

AAGGCTACCGCTAAGCTCGCTCTACAAGCAACAGAAATrTTC^ 
LysAlaThrAlaLyaValAlal^uClrJUaArgCluneLeuneGlnGlyAlaL^^ 

IVlTJVl^ltjAAGCTICCCTATGGCTCATCCAA 

SerValValLysValAlaTyrGlySerSerAsnProLysArgSerGluProAspL^uAsnlleLeuLysAspLeuIleAspSerAsnTyrValValPheAspVal 
TTACTTCAATTCCTAACCCATCAACACCCACTnriCAC^ 

LeuLeuC InPheLeuThrH i sG InAspProVa 1 Va 1 ThrA 1 aA 1 aA 1 aA 1 aG 1 nVa 1 Ty r 1 1 eArgArgAl aTyrArgAl aTy rThrl leGlyAspI leArgVal 
CACGAAGGTCTCACAGTrcCAATTCTTGAATCGAAATTCCAACTACCTO 

H i sG luC 1 y Va 1 ThrVa 1 Pr o 1 1 e Va 1 G 1 uTrpLy s PheC InLeuProSerAlaAlaPheSerThrPheProThrValLysSerLysMetGlyMetAsnArgAla 

C iriVmrriV AGATTTGTCATATCTTGCAAACACTCACTC^ 
ValSerValSerAspL*uSerTyrValAlaAsnSerGlnSerSerProL*uArgGlu^ 

TCACAAAGTTTGGAAGTTATTCCTCGTCACCAA1XJT1VT 

SerGlnSerl^uGluVallleProArgHisGlnSerSerSerAsnGlyProAlaProAspArgSerGlySerSerAlaSerLeuSerAsnValAlaAsnValCys 
GrTTXTTCTACAGAAGGTITCGAATCTGAAGAGGAAATTrrGG^ 

ValAlaCj;rThrCluCliThcCliircrCluClnCluIl"T &y 7 i.«iiar ? ninTiaLfluAani^HiA«nT.y«niTrf:iiit^niTiPA<nA] «<!i*ri leArgArg 

ATCACATITATCTOGGTTTTAAAGATGGGTCTTATCCAAAGT^ 
IleThrPheMetPheGlyPheLysAspGlySei^rProLysTyrTyrThrPhe^ 

TTX^CTTCCAACTCGAATTAGGAAGATTGTCCAACTTC 

LeuAl a PheG 1 n LeuG 1 uLeuG 1 yArgLeuSe r A sn PheAsn IleLysProIlePheTh r AspAsnArgAsn IleHisValTyrGluAl aVa lSerLysThrSer 
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.*CATTCX»TAACAGATTCrrTTAC^ 

ProLeuAspLysArgPhePho'rtjrArgClyl lelleArgThrClyHisIleArgAspAspI leSerlleGlnGluTVrUuThrSerGluAlaAsnArgLeuMet 1365 
A GTCATATATTCGATAATTrAGAACTCACCGACACTT^^ 

SerAspIleLeuAspAsnl^uGluValThrAspThrSerAsnSerAsp^ 1400 
GAAGCCGCCTItraTCCTTTCTrAGAAAGATTTGGTAAGA^ 

GluAlaAlaPheGlyGlyPheLeuCluArgPheGlyLysArgLeuLeuArgl^^ 1435 
GCCCCACTACCATTGCCTGCCTIGATCAATAACGTTTCTGGTTATGTTATCA^ 

AlaProValProLeuArgAlaLeuIleAsnA^iiliftJU^ClyTyrVallleLysThrGluMe 1470 
TCTTKXXITAAACCTGGATCCATGCATTrAACACCTATTGCTAC^ 

Ser LeuClyLysProGlySerMetHlsl^uArgProIleAlaThrPro^^ 1S05 
iCATATGTCTATCACTTCCCAGAATTATTCCGCCAAGCATCGTC 

■T^rtyrValTyrAspPheProGluLtfuFheArgGlnAla^^ 1540 
■^AGTTCATIGAAGATCAAAACGGCGAATrAACTGAGGTGGAAAGAGAACCTGGT^ 

G^u^uIleGluAspGluAsnGlyCluLeuThrCluValGluArgGl^^ 1575 
TATCCAAGAGGCCGTCAATmrrTCTTGrroCTAACGATATCACATTCA^ 

iy r ProArgG 1 yArgG 1 nPhe Va 1 Va 1 Va 1 Al aAsnAspI 1 el* r PheLy s 1 1 eG 1 y Ser PheG 1 y ProG 1 nG 1 uAspG 1 uPhe PheAsn Ly sVa 1 Th rG 1 uTy r 1 6 1 0 
GCTAGAAAGCGTGGTATCCCAAGAATTTACTTGCCTGCAAACTC 

AlaArgLysArgGlyllaProArglleTyrLeuAlaAlaAsnSerGlyAlaArglleGlyMetAlaGluGluIleValProLeuPheGlnValAlaT^ 1645 
GCTCCCAATCCGGACAAGGGCTTCCAATACTTATACTTAACAAGTGAA^ 

AlaAlaAsnProAspLysGlyPheClnTyrLeuTyrl^uThrSerGluGlyMetGlu^ 1680 
GT^ATAAACGGTGAAGAAAGATTKJTCATCAAGACAATTATIGGTTCTC 

VallleAsnGlyGluCluArgPheVallleLysThrllelleGlySerGluAspGlyLeu^^ 1715 
TCAAGGGCTTACCACCATATCrrcACTATCACCTTAGTC^ 

SerArgAlaTyrHisAspIlePheThrlleThrLeuValThrCysArgSer^ 1750 
GGCCAGCCAATTATTKXITATCGGTGCTrATTAACTGGTGCTCCTCAATC 

GlyGlnProIlelleTrpTYrArgCysLeuLeuThrGlyAlaProGl^^ 1785 
ATCATCTATAACAACGCTCTTTCACATTTCACTGCTGTO 

IleHetTyrAsnAsnGlyValSerHisLeuThrAlaValAspAspI^uAlaGly^ 1820 
CCAGTTCCTATCTTGGAAACTAAAGACACATGGGATAGACCAGTrcATTTCACTCCAACTAAT^ 

ProValProIleUuGluThrLysAspThrTrpAspArgProValAspPheTh^^ I 855 
ACTGAAAGTGGATTTGAATATCGTTTGTITGATAAAGGGTC 

ThrCluSerClyPheGluTyrGlyLeuPheAspLysGlyflorPheFheOluTl^^ 1890 
ATTCCACTCGGTCrTATTGGTGTTGAAACAAGAACTGTC 

IleProLeu(UyValIleGlyValGluThrArgThrValGluAsn^ 1925 
CAAGTTIGGCATCCAAACTCCGCCTrcAAGACTGCTCAAGCTATC 

GlnValTrpHisProAsnSerAlaPheLysThrAlaGlnAlalleAsnAspPheAsnAsnGlyGluGlnLeuProMetMetlleLeuAlaAsnT^ I960 
TCTTXJTXXnXTAACGTCATATCTTCMCGAAGTCTTCAAGTATGGrrCG^ 

SerGlyGlyGlnAxgAspMecPheAsnGluValLeuLysTyrGlySerPhelleValAspAlal^uValAspTyrLysClnProIlellelleTyrlleProPro 1995 
ACCGGTCAACTAAGAGGTGGTrCATGGGTrcTTGTCGATCCAACTATC 

ThiSlyGluI^uArgGlyGlySerTrpValValValAspPro^ 2030 
CCACAAGGTATGGTroGTATCAAGTTCCGTAGAGAAAAATroCTGGACACCATGAACAGAT^^ 

ProGlnGlyMetValGlylleLysPheArgArgGluLysLeuI^uAspThrMetAs^ 2065 
AGTTIGGCTCCACAACTACATCAGCAAATATCCAAGCAAmGCT 

SetLeuA 1 aProG 1 uVa 1 H i sG InG 1 n 1 1 eSe rLy sG 1 nLeuAl aAspAr gG 1 uAxgG 1 uLeuLeuPro 1 1 eTy rGlyGlnll eSe r LeuG In PheA 1 aAspLeu 2100 
CACGATAGCrrcTTCACCTATGGTXXKCAAGGCTGTTAm 

HisAspArgSerSerArgMetValAlaLysGlyVallleSerLysGluI^uGluTrpThrGluAlaArgAxgPhePhePheTrpArgLeuArgArgArgLeuAsn 2135 

gaac^tatttcattaaaaggtix^gccatcaggtaggcgaagcatcaaga™ 

GluGluTyrUnilleLysArgLeuSerHisGlnValGlyGluAlaSerAr^^ 2170 
GATGATAGGCAAGTCGCAACATtWATTGAAGAAAACTACAAAACTTTGGACGA 

AspAspArgGlnValAlaThrTrpUeGluGluAsnTyrLysThrl^uAspAspLysI^^ 2205 
ATCAGAAGCGACCATGACAATGCTATTGATO^TTATCTGAAGrrATC 

I leArgSerAspHisAspAsnAlal UAspGlyLeuSerGluVallleLysMetLeuSerThrAspAspLysGluLysLeuLeuLysThrLeuLys * 2237 



Trans - 
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Fig. 4. Nucleotide sequence 
of the gene coding for ACC and 
the predicted amino acid se- 
quence of the protein. Number- 
ing of the nucleotides starts with 
the A of the first ATG Met 
codon. Numbering is shown 
only for the amino acids. Under- 
lined amino acid sequences indi- 
cate protein sequences that have 
been verified by amino acid se- 
quencing and the glycosylation 
sites. The amino acid sequences 
in bold letters indicate the puta- 
tive nucleotide-binding motif, 
biotin-binding site, and the pu- 
tative CoA-binding site. 



internal deletions of 17 and 18 amino acids near residues 1249 
and 1328, respectively. Other shorter deletions throughout 
both yeast and rat ACC make up the overall difference. A 
surprising lack of homology occurs between the sequence of 
the first 100 amino acids of rat, the region of the rat enzyme 
that can be phosphorylated in vitro by six protein kinases (5), 
and that of the yeast enzyme. In rat ACC, phosphorylation of 
Ser-77 and Ser-79 by cAMP-dependent and AMP-activated 
protein kinases decreases carboxylase activity (lower V max ) 
and increases citrate concentration required for half-maximal 
activation (5). Although there are several serine residues near 
the NH 2 -terminai region of yeast ACC, whether or not one or 
more of these serine residues are phosphorylated remains to 
be determined. Other proposed phosphorylation sites on the 
rat ACC are Ser-1200, phosphorylated by both cAMP and 
AMP-dependent protein kinases, and Ser-1215, phosphory- 
lated by the AMP-activated kinase (5). Phosphorylation at 
these sites may be important in the regulation of ACC activity 
(21). Similarly, the yeast ACC has serine residues in the 
equivalent peptide region (residues 1200-1220), and we sus- 
pect that phosphorylation of a serine residue in this region 
may result in decreased activity (K. V. Venkatachalam. 
W.-Y. Huang, and S.J.W., unpublished results). 



The polypeptide segments of the yeast ACC that exhibit 
high amino acid sequence similarity to those of rat enzyme 
vary in length and occur in three subdomains along the 
protein (Fig. 5). The first subdomain, which spans the region 
near the NH 2 terminus (residues 100-600) and exhibits about 
50% identity to corresponding sequences of rat enzyme, 
could be the putative biotin carboxylase component of yeast 
ACC because of its high sequence similarity to corresponding 
amino acid sequences of known biotin enzymes such as yeast 
pyruvate carboxylase (residues 162-355), and the a subunit of 
human propionyl-CoA carboxylase (residues 177-375). 
Moreover, within the yeast ACC subdomain, the region 
between amino acid residues 235 and 392 is highly homolo- 
gous to a corresponding segment of rat ACC and may contain 
the ATP and HCOf binding sites. This conclusion was based, 
in part, on the presence of the Gly-rich motif of Gly-Xaa- 
Gly-Xaa-Xaa-Gly or Gly-Xaa-Xaa-Gly-Xaa-Gly, which have 
been suggested as the nucleotide-binding motif for yeast and 
rat carbamoyl-phosphate synthases and rat ACC (3, 22). 
Further, Hamada et al. (23) have proposed that the sequence 
Tyr-GIy-Tyr-Thr-His-Leu-Ser-Thr-Gly in rabbit muscle my- 
okinase (residues 32-40) is involved in the binding of 
MgATP. Similar sequences are found within this subdomain 
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Fig. 5. Dot-matrix plot of rat vs. yeast amino acid sequences of 
ACC. 

of yeast ACC (residues 366-373), Tyr-Leu-Tyr-Ser-His-Asp- 
Asp-Gly, and in rat ACC (residues 424-432), Tyr-Leu-Tyr- 
Ser-Gln-Asp-Asp-Ser; however, their significance in nucle- 
otide binding remains to be determined. 

The conserved biotin-binding site Met-Lys-Met is located 
at residues 734-736 in yeast ACC, which represents the 
region of biotin carboxyl carrier protein, the second subdo- 
main. Comparison of amino acid sequences near the biotin- 
binding sites with all known biotin-containing enzymes 
showed that (i) the Met-Lys-Met sequences of yeast and 
animal ACCs are preceded by Val instead of Ala, as in other 
carboxylases, and (ii) these sequences are located closer to 
the NH 2 termini of the molecules, whereas in other biotin- 
containing enzymes this sequence occurs near the COOH 
termini. It has been suggested that this positioning of the 
biotin-binding site may increase the efficiency of the bioti- 
nylation of the proteins (24). In all the biotin-containing 
enzymes, the biocytin residue is located 25-29 amino acids 
downstream from a short amino acid sequence flanked by 
two Pro residues [-Pro-(Xaa)„-Pro-), which might act as a 
hinge to permit the biotin-containing arm to move between 
the carboxyl donor and acceptor sites (2). Yeast, rat, and 
chicken ACCs also contain similar sequences (Fig. 4). 

Despite the conservation of biotin-binding sites -Met-Ly s- 
Met- among all biotin-containing proteins, the amino acid 
sequences surrounding these sites are divergent. Indeed, the 
sequence of yeast ACC between residues 600 and 1700 is least 
homologous (26%) with that of the rat ACC (Fig. 5). The lack 
of conservation of the sequences in these regions of the 
enzymes may pertain to the assembly of the enzyme subunits 
into polymer forms that make up the enzymatically active 
carboxylases. Also, this lack of conservation suggests that 
the amino acid sequences within BCCP domains of the 
carboxylases may be involved only in providing a scaffold for 
the critical regions of the structure to function. In this regard, 
the biotin in the BCCP domain may be akin to the 4'- 
phosphopantetheine prosthetic group of the acyl carrier 
protein domain of the fatty acid synthase, where the quater- 
nary structure of the protein and the need for the pantetheine- 
bound fatty acyl intermediates to interact with the various 
catalytic domains of the synthase play a crucial role in the 
overall activity of the enzymes. 

The third subdomain of the yeast ACC, residues 1700- 
2100, is highly homologous (60%) to the corresponding seg- 
ment of the rat ACC, residues 1650-2200 (3). Within these 
subdomains, sequences between amino acid residues 1870 
and 1890 are highly similar to the proposed sequence of the 
"adenine recognition loop" of porcine and yeast citrate 



synthase (25, 26) and to the 0 subunit of human propionyl- 
CoA carboxylase (27). Hence, these peptide segments in 
ACC are likely to be components of the CoA-binding site. 

Recently, Bowers and Allred (28) reported that the rat liver 
ACC is a glycoprotein. In the yeast carboxylase sequence, 
there are nine sites that can be N-glycosylated, as indicated 
in Fig. 4. However, the presence of carbohydrates in the 
carboxylase and the involvement of any of the putative 
glycosylation sites remain to be investigated. 
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ABSTRACT Acetyl-CoA carboxylase [ACCase; acetyl- 
CoA:carbon-dioxide ligase (ADP-fonning), EC 6.4.1.2] cata- 
lyzes the ATP-dependent carboxylation of acetyl CoA to pro- 
duce malonyl CoA. In plants, malonyl CoA is needed for plastid 
localized fatty acid biosynthesis and for a variety of pathways 
in the cytoplasm including flavonoid biosynthesis. We have 
determined the fuU nucleotide sequence of an ACCase from 
alfalfa, which appears to represent a cytoplasmic isozyme. 
Partial cDNAs were isolated from a cDNA library of suspension 
culture cells that had been elicited for isoflavonoid phytoalexin 
synthesis. The full-length sequence was obtained by primer 
extension and amplification of the cDNA with synthetic prim- 
ers. The sequence codes for a protein of 2257 amino acids with 
a calculated M T of 252,039. The biotin carboxylase, biotin 
carboxyl carrier protein, and carboxyltransferase domains, 
respectively, show approximately 72%, 50%, and 65% se- 
quence similarity to those of animal, diatom, and yeast ACCase 
sequences. ACCase enzyme activity and transcripts are in- 
duced severalfold upon addition of yeast or fungal elicitors to 
alfalfa cell cultures. 

Acetyl-CoA carboxylase [ACCase; acetyl-CoAxarbon-di- 
oxide ligase (ADP-forming), EC 6.4.1.2], a biotin-containing 
enzyme, catalyzes the first step in fatty acid biosynthesis via 
ATP-dependent carboxylation of acetyl CoA to form malonyl 
CoA. This reaction has been implicated as a key regulatory 
step in de novo fatty acid biosynthesis in the plastids of plant 
leaves (1-3) and in the cytosol of animal cells (4). 

A single ACCase polypeptide has been purified from 
several higher plants, algae, and animals. However, two 
ACCase isozymes differing in their sensitivity toward the 
herbicides haloxyfop and sethoxydim have recently been 
identified in maize (5). In addition to providing malonyl CoA 
for plastid fatty acid synthesis, additional ACCase isozymes 
may be needed in the cytosol to supply malonyl CoA for the 
synthesis of very long chain fatty acids (6), flavonoids (7, 8), 
isoflavonoid phytoalexins (9), and stilbenes (10), and in the 
inactivation of the ethylene precursor 1-aminocyciopropane 
1-carboxylate (11). ACCase activity is induced in UV- 
illuminated parsley cultures (7) and elicitor-treated soybean 
cultures (8) at the onset of accumulation of flavonoids and 
isoflavonoid phytoalexins, respectively. The increased ACC- 
ase activity in these cultures is presumed due to a higher 
demand for malonyl CoA for the chalcone synthase reaction. 

ACCase has been cloned from Escherichia coh (12-15), 
chicken (16), rat (17), yeast (18), and the diatom Cyclotella 
cryptica (19). Recently, two subumts of Anabaena sp. 
ACCase have been cloned (20). The full-length sequence of 
a higher plant ACCase has not previously been reported. The 
E coli ACCase is composed of four separate proteins: the 
biotin carboxyl carrier protein (BCCP), the biotin carboxyl- 
ase (BC), and two subunits of carboxyltransferase (CT). In 
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contrast, mammalian and yeast ACCases contain the BCCP, 
BC and CT as functional domains within a single polypeptide 
with an apparent mass of 200-250 kDa. In plants, both 
multisubunit (21-23) and multifunctional polypeptide forms 
of ACCase (3) have been reported. 

We report here the isolation of alfalfa ACCase cDNAs 
covering the complete ACCase open reading frame, which 
exhibits 60-63% sequence similarity and 40-42% sequence 
identity at the amino acid level to chicken, rat, yeast, and C. 
cryptica ACCases. We also show that treatment of alfalfa cell 
suspension cultures with fungal or yeast elicitors results in 
substantial induction of ACCase activity and transcript lev- 
els. Based on the absence of a transit peptide in the sequence 
and the induction of expression by elicitors, we suggest that 
the alfalfa ACCase clone codes for a cytoplasmic isozyme.* 

MATERIALS AND METHODS 

Growth and Elicitation of Alfalfa Cell Cultures. Cell sus- 
pension cultures of alfalfa (Medicago sativa L.) cv. Apollo 
were initiated and maintained as described elsewhere (24). 
Dark-grown cells were treated at 5 days after subculture with 
elicitor preparations from the cell walls of Colletotnchum 
lindemuthianum (24) or yeast (25) at final concentrations of 
50- ug glucose equivalents per ml of medium. 

Extraction and Assay of ACCase Activity. Alfalfa ceU sus- 
pension cultures (1 g) were homogenized with a Polytron on 
ice in 2 ml of 100 mM Mes, pH 6.5/2 mM dithiothreitol. The 
supernatant was brought to 45% saturation with solid 
(NH 4 ) 2 S0 4 and left on ice overnight. The precipitated protein 
was resuspended in 400 pi of extraction buffer and 30 /d of 
the extract was used immediately for ACCase assay (based 
on the formation of acid-stable radioactive malonyl CoA from 
H 14 C0 3 and acetyl CoA) as described elsewhere (21). Assays 
lacking acetyl CoA were included for each cell extract as a 
control. Protein content was determined in duplicate by the 
Bradford assay (26) using bovine serum albumin as a stan- 
dard. 

Construction and Screening of an Alfalfa Genomic Library. 
Genomic DNA was isolated from alfalfa leaves (cv. Apollo), 
partially digested with BamHh filled in with dGTP and 
dATP, and ligated to Lambda FIX ll/Xho I filled in with 
dATP'and dGTP as described by the supplier (Stratagene). 
The ligated product was packaged with a Gigapack II Gold kit 
and amplified in the host strain PLK17 (Stratagene). The 
library was plated with the host P2PLK17 and screened with 
the 3'ACC probe by plaque hybridization at 42°C as de- 
scribed (27). Positive plaques were amplified in PLK17 and 
purified as described elsewhere (28). A region of the genomic 

Abbreviations: ACCase, acetyl CoA carboxylase; BCCP, biotin 
carboxyl carrier protein; BC, biotin carboxylase; CT, carboxyltrans- 
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5The sequence reported in this paper has been deposited in the 
GenBank data base (accession no. L25042). 
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DNA insert was amplified by PCR using universal primer 
(CAU) 4 T3, which anneals to the Lambda FIX II arm, and 
primer 180 (described below). The amplified genomic DNA 
was annealed to the pAMPl vector and sequenced as de- 
scribed below. 

Screening of an Alfalfa cDNA Library. A cDNA library was 
constructed in Lambda ZAP II starting with poly(A) RNA 
isolated from alfalfa cell suspension cultures treated with 
fungal elicitor for 2, 3, and 4 h. A cDNA fragment (585 bp) 
encoding a putative alfalfa ACCase was identified by its 
sequence (see Results). This fragment was used to screen the 
library as described (27). To find additional cDNAs overlap- 
ping the 5' end of characterized ACCase cDNAs, probes 
corresponding to 200-300 bp at the 5' ends of the identified 
cDNAs were generated by PCR (27) and used to further 
screen the cDNA library. 

5' Rapid Amplification of cDNA Ends (RACE) PCR for 
Isolation of Sequences Corresponding to the 5' End of the 
ACCase Transcript. First-strand cDNA synthesis by primer 
extension. Total RNA (10 /xg) was isolated from cell suspen- 
sion cultures treated with yeast elicitor for 5 h and mixed with 
200 ng of 30-mer primer corresponding to the 5' end of clone 
Tl (Fig. 1). The RNA/primer mixture was denatured at 65 6 C, 
annealed at 42°C, and reverse-transcribed in a total vol of 20 
/ii using M-MLV (BRL) at 42°C in the presence of 10% 
dimethyl sulfoxide. The RNA was hydrolyzed with 2 pH of 2 
M NaOH at 65°C for 1 h and then neutralized with 2 pi of 2 
MHC1. 

Poly(A) tailing and second-strand cDNA synthesis. Four 
of the first-strand cDNA synthesis reaction mixtures were 
pooled and the cDNA was purified into 50 pi of H 2 0 using a 
GeneClean II kit (Bio 101). The cDNA (50 pi) was tailed at 
its 3' end with poly(A) in a final vol of 100 pi using 75 units 
of terminal deoxy nucleotidyltransferase (BRL) at 37°C. The 
second strand was synthesized with 2.5 units of Taq DNA 
polymerase in a total vol of 100 pi containing 50 pi of 
poly(A)-tailed cDNA, 200 pM each dNTP, 1 pM anchor 
primer 134 (AAGCTTCTGCAGGAGCTCTTTTTTTTT- 
TTTTTTTT), and 3 mM MgCl 2 . The mixture was incubated 
at 95°C for 3 min, 25°C for 5 min, 30°C for 10 min, 40°C for 
15 min, and 72°C for 30 min. 

Nested primers used for amplification. The second-strand 
cDNA reaction mixture (10 pi) was amplified at 50°C anneal- 
ing temperature with 20-mer primers that included at their 5' 
end the base sequence CAUCAUCAUCAU or CUACUA- 
CUACUA to facilitate cloning into the pAMPl vector. The 
first PCR product was generated by primers 135 (based on 
primer 134 above) and 136 (corresponding to the 5' end of 
clone Tl). The second amplification was with primers 141 
(corresponding to the 5' end of the cDNA amplified with 
primers 135 and 136) and 142 (designed to encode the peptide 
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Fio. 1. Schematic representation of alfalfa ACCase sequences. 
Nucleotide sequence position (in kb) and partial restriction map sites 
are indicated. Regions used as probes for Southern and Northern blot 
analysis are indicated by solid bars (5T, 3'ACC). cDNA clones are 
represented by open bars. PCR-generated cDN As are represented by 
hatched bars. Numbers above each hatched bar represent the pair of 
primers used to generate the PCR product. H f /ff/idlll; X, Xba I; E, 
EcoRV. 



EGGGGKG with codon usage based on the earlier charac- 
terized alfalfa ACCase cDNA sequences). The third ampli- 
fication was with primers 146 (corresponding to the 5' end of 
the cDNA amplified with primers 141 and 142) and 147 
(designed to encode the peptide KVLIANNG). The peptides 
EGGGGKG and KVLIANNG are conserved between rat, 
chicken, and yeast ACCases. The fourth amplification used 
primer 209 (based on genomic ACCase sequences) and 180 
(corresponding to the 5' end of the cDNA amplified with 
primers 146 and 147) (Fig. 1). 

Isolation and Sequencing of DNA Templates. Plasmids iso- 
lated from the alfalfa cDNA library were subjected to double- 
strand sequencing of both strands using dideoxynucleotide 
chain termination (29) with T7 DNA polymerase (United 
States Biochemical). Each PCR-generated clone was isolated 
from three independent colonies and was fully sequenced on 
both strands using universal and synthetic primers (30) and 
Taq DNA polymerase with an ABI robotic catalyst and 
model 373A DNA sequencer. In some cases, the PCR prod- 
uct was directly subjected to sequence analysis. The align- 
ment was determined by using the gap program of Genetics 
Computer Group (University of Wisconsin, Madison). 

Northern and Southern Blot Analysis. Total RNA (15 p%) 
was isolated from alfalfa cell suspension cultures (31), sep- 
arated on 1% agarose containing 2.2 M formaldehyde, and 
blotted onto nitrocellulose membranes. The blot was hybrid- 
ized at 42°C in the presence of 50% formamide as described 
elsewhere (32). RNA blots were tested for equal loading by 
probing with alfalfa 25S rRNA or Gl cDNA probes (32, 33) 
(data not shown). High molecular weight genomic DNA was 
isolated from plant leaves (34), digested with restriction 
endonucleases, electrophoresed in 0.7% agarose, and blotted 
onto a GeneScreenP/j/5 nylon membrane with 0.4 M NaOH. 
Blots were hybridized and washed at 65°C or 55°C as de- 
scribed elsewhere (32). 

RESULTS 

Isolation of Alfalfa ACCase cDNA Clones. An alfalfa cDNA 
clone designated Gl <32), isolated by screening an alfalfa 
cDNA library with a human protein disulfide isomerase 
probe, was found to be fused through its poly (A) tail to an 
unrelated 585-bp cDNA fragment. This fragment (3'ACC) 
(Fig. 1) encoded a polypeptide with 40% amino acid sequence 
identity and 77% sequence similarity to the chicken ACCase 
in their overlapping regions. Rescreening of the alfalfa cDNA 
library (see Materials and Methods) led to the isolation of 
three clones: Q7 (2567 bp), M2 (4200 bp), and Tl (1467 bp) 
(Fig. 1). When aligned, the three clones covered 5.3 kb of the 
ACCase coding region and their overlapping regions were 
100% identical. The cDNA library lacked clones containing 
the 5' coding region of the putative alfalfa ACCase. This 
region of the ACCase sequence was therefore obtained by the 
5' RACE PCR method. This led to the isolation of four 
additional clones designated 135/136 (625 bp), 141/142 (835 
bp), 146/147 (700 bp), and 209/180 (351 bp) (Fig. 1). 

Primary Sequence of Alfalfa ACCase. Alignment of all the 
above alfalfa ACCase cDNA clones yielded a sequence of 
7194 bp with 69 and 353 bp of 5' and 3' untranslated regions, 
respectively. The 3' untranslated region contained two po- 
tential polyadenylylation signals (A ATA A) starting at posi- 
tions 7113 and 7124 (data not shown). Of 7 cDNA clones 
sequenced at the 3' untranslated ends, 1 was polyadenylyl- 
ated at position 7171, 4 at position 7175, and 2 after addition 
of the sequence TTTTAT to position 7175. 

The coding sequence of 6771 bp contains a methionine 
residue, which initiates the longest open reading frame of 
2257 amino acids encoding a protein of M r 252,039 (Fig. 2). 
This methionine residue has a nucleotide context similar to 
the Kozak initiator methionine consensus sequence (35) and 
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is preceded by a cDNA sequence rich with stop codons in all 
three reading frames. Based on sequence alignment with 
other known ACCases, the putative alfalfa ACCase protein 
contains three functional domains arranged, starting from the 
N-termina! end, in the following order: BC, BCCP, and CT 
(Fig. 2). A putative biotin binding site at a lysine residue 
flanked by two methionine residues starts at position 712 in 
the alfalfa ACCase protein (Fig. 2). The two proline residues 
present 27 and 35 residues upstream of the putative biotin 
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binding site in alfalfa ACCase (Fig. 2) may be involved in the 
formation of a hinge region for the movement of carboxybi- 
otin (36). Putative acetyl CoA and carboxybiotin binding sites 
in the alfalfa ACCase have 80% and 60% sequence identity to 
the corresponding regions in the rat ACCase (Figs. 2 and 3). 
Also, a putative ATP-binding site in the alfalfa ACCase 
matches the consensus sequences Gly-Xaa-Gly-Xaa-Xaa- 
G!y or Gly-Xaa-Xaa-Gly-Xaa-Gly shared by nucleotide bind- 
ing proteins (37, 38) and has 86% identity with the corre- 
sponding region in the rat ACCase (Figs. 2 and 3). The 
sequence Glu-Leu- Asn-Pro-Arg located at positions 354-358 
in the BC region -of alfalfa ACCase (Fig. 2) is similar to the 
consensus sequence Glu-Met-Asn-Pro-Arg, which was pro- 
posed to be a candidate for the catalytic site of biotin- 
dependent carboxylases and carbamoyl phosphate syn- 
thetases (39). 

Genomic Organization of ACCase. Two bands with differ- 
ent hybridization intensities were detected on probing alfalfa 
genomic DNA with the 3 # ACC cDNA (Fig. 1) at high- 
stringency hybridization and washing conditions (65°C) (Fig. 
4A). Southern blot analysis using mixed alfalfa 3'ACC and 
5'T cDNA probes (Fig. 1) detected, at stringent hybridization 
and washing conditions (55°C), only one band in Arabidopsis 
(Fig. 4C), approximately two bands in rice and wheat (Fig. 
45), and three or four bands in pea, peanut, soybean, and 
tobacco (data not shown). 

Elicitation of ACCase. The specific activity of ACCase in 
alfalfa suspension cells increased in response to treatment 
with yeast elicitor, reaching a maximum 4- to 5-fold increase 
between 8 and 12 h postelicitation (Fig. 5). The average 
specific activities of control and induced cells between 8 and 
12 h postelicitation were 5.3 and 23.4 nmol of acetyl CoA per 
mg of protein per min, respectively. 

ACCase transcripts were not detected in unelicited alfalfa 
cell suspension cultures. However, transcripts were signifi- 
cantly induced within 2 h of exposure to fungal or yeast 
elicitor (Fig. 6). ACCase transcripts reached maximum levels 
at 6 h and declined rapidly by 8 h after treatment with the 
fungal elicitor (Fig. 6A). In contrast, transcript levels re- 
mained elevated beyond 8 h in yeast elicitor-treated cultures 
(Fig. 6B). 

DISCUSSION 

Possible Role for Multiple ACCase Enzymes in Plants. In 

most organisms, the major role of ACCase is production of 
malonyl CoA for fatty acid synthesis. In plants, the biochem- 



Alfllf* 

C. cryptic* 

Yrast 

fUt 

E. coll BC 



Acetyl -CoA 

Alfalfa 

C. cryptic* 

Yeast 

Rat 

£. coll m-CT 



G T f A K't U I V 0 0 G 6 t 0 I I K 7 240 
EKC 1 I 1 U S I 0 G 0 0 K 0 I E F V 306 

0 F P V « I K i I E G C 0 0 1 6 I 1 0 f 26S 

01 rvmiASEGoootonn 323 

6TFV1XKASCQCGQR0MRVV172 



a r t rt t 0 »** k roTm*:W 1903 

CKSWlOiCRlflOIPH Q ~ A ~ Tk 1?77 
1KC UVG&l RLGOULOV 1 C 1897 
A Q T V V V C & A R L C 0 X Q V V"A 1986 
D K ATV C CT* R t"D"C R ty H t I G 117 



C;*rfroryblotin 

Alfalfa 

C. cryptic* 
Yeast 
Rat 

E. coll B-CT 



Alfalfa 

C. cryptic* 

£. coil fl-CT 



ort i l »'? s ruHrr* a rvro r » b a * uu 

GIO VV 91 VBDVTV'OS 9B JOV El D E V 1500 
OIQ f VVUIl I Tflll GSFGFQEDEr 1603 
ORDV1VICHDITYRICSPCPQBDLL 1697 
CKPVVXAAF'EFAIMGQ S"M 0 S V V C A R UO 




Fig. 2. Alignment of alfalfa and rat ACCase amino acid se- 
quences. Biotin binding site is indicated by asterisk. Putative binding 
sites for ATP (amino acids 221-240), acetyl CoA (amino acids 
1884-1903), and carboxybiotin (amino acids 1594-1643) are shaded. 



Fic. 3. Amino acid sequence alignments of putative ATP, acetyl 
CoA, and carboxybiotin binding sites from several ACCases. Resi- 
dues identical to the putative alfalfa sequence are shaded. a-CT, a 
subunit of CT; 0-CT, 0 subunit of CT. 
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Fig. 4. Genomic organization of ACCase in alfalfa cv. Apollo and 
Regen SY (RSY), rice, wheat, and Arabidopsis (Arab). Genomic 
DNA (10 fig) was digested with £coRV (E), HMIU (H), and Xba I 
(X), or mixtures of these enzymes, resolved on a 0.7% agarose gel, 
and blotted to GeneScreenP/wj. Blot A was probed with the 3'ACC 
cDNA. Blots B and C were probed with a mixture of the 3'ACC and 
5'T cDNAs. Numbers on left are kb. 

ical fate of malonyl CoA is considerably more diverse. 
ACCase activity has been reported in plastid preparations 
from numerous species and it is likely that malonyl CoA is 
used almost exclusively for de novo fatty acid synthesis in 
this organelle (3). Outside the plastid, malonyl CoA enters 
into several pathways (7-9, 11, 40), including the elongation 
of fatty acids (6). Production of malonyl CoA for these 
diverse roles could be accomplished by multiple ACCase 
isozymes in different subcellular compartments or, alterna- 
tively, by a single enzyme accompanied by transport of 
malonyl CoA or malonate across membranes. 

Early observations on fatty acid elongation in oilseeds 
strongly implicated the participation of multiple ACCase 
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Fig. 5. Elicitation of ACCase activity in alfalfa cell suspension 
cultures in response to yeast elicitor glucose equivalents per 
ml of medium) (+ elicitor) or an equivalent volume of water (control). 
Bars represent the spread of values from duplicate elicitor-treated 
cultures. 



Fig. 6. Expression of ACCase transcripts in total RNA (15 pig) 
isolated from alfalfa cell suspension cultures at the indicated times 
(hours) after exposure to Colletotrichum elicitor (50-jtg glucose 
equivalents per ml of medium) (lanes +) or an equivalent volume of 
water (lanes -) (A) or yeast elicitor (50- /xg glucose equivalents per 
ml of medium) {B). Blot A was probed with the 3'ACC cDNA. Blot 
B was probed with a mixture of 3'ACC and 5T cDNAs. 

enzymes in the synthesis of very long chain fatty acids (41). 
Recently, further evidence for more than one form of 
ACCase has been obtained from maize leaves, where both 
herbicide-tolerant and -sensitive isozymes could be resolved 
(5, 42), and from pea chloroplasts (23), where a transcarbox- 
ylase subunit of ACCase has been characterized. Thus, 
although the current evidence is indirect, the most likely 
interpretation of these data is that plants contain ACCase 
isozymes in both the plastid and the cytosol. 

Two major lines of evidence suggest that the cDNA 
described in this work represents a cytoplasmic form of 
ACCase. First, the N-terminal amino acid sequence does not 
appear to code for a chloroplast transit peptide or for any 
other signal sequence. All nuclear-encoded plastid proteins 
contain an N-terminal extension of ==50 amino acids, which 
directs transport of these proteins into the plastid. When 
aligned, rather than containing an N-terminal extension, the 
N-terminal sequence of the alfalfa BC domain is =20 and =80 
amino acids shorter than the yeast and chicken BC domains, 
respectively. In addition, a 16-residue sequence, which is 
highly conserved with all other eukaryotic cytoplasmic 
ACCases, begins 36 residues into the alfalfa sequence. Sec- 
ond, ACCase activity and transcripts are strongly induced by 
treatment of the alfalfa cell cultures with yeast or fungal 
elicitors. This induction is characteristic of the chalcone 
synthase enzyme, which requires malonyl CoA as a cosub- 
strate for biosynthesis of flavonoid in the cytoplasm (7-9). 
Direct demonstration of the subcellular location of the pro- 
tein associated with this ACCase cDNA will await immuno- 
cytochemical analysis or chloroplast uptake studies of in vitro 
translated ACCase cDNA. 

At this time it is not clear how closely related are the 
cytoplasmic and plastid forms of ACCase. Partial cDNA 
sequences have been reported for maize and wheat ACCase 
in the GenBank data base but it is not certain whether these 
code for plastid isozymes. These monocot sequences have 
=65% identity (80% similarity) with the dicot alfalfa ACCase 
in their overlapping regions. In contrast, we have sequenced 
an Arabidopsis genomic clone for ACCase that is >80% 
identical (89% similar) to alfalfa in amino acid sequence (43). 
More detailed sequence comparisons indicate that all the 
available plant ACCase sequences have local regions of very 
high identity and therefore heterologous probes at low strin- 
gency may cross-hybridize between them. For this reason, 
hybridization strategies to determine tissue or developmental 
expression patterns may need to account for potential con- 
tributions from a mixture of ACCase mRNA species. 
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Genomic Organization. The detection of two bands in 
Southern blots with different hybridization intensities sug- 
gests that ACCase in alfalfa is encoded by at least two related 
genes. Southern blot analysis of alfalfa ACCase-related genes 
in other plants suggests that Arabidopsis, rice, and wheat 
ma y have one or two corresponding genes in their genome. 
At this time it is not possible to determine whether genes 
coding for both cytoplasmic and plastidial forms of ACCase 
would be detected by the alfalfa probe in these blots. Since 
the nucleotide sequences of >71 independently isolated 
alfalfa ACCase cDNAs were identical in their overlapping 
regions, both in the coding and in the untranslated regions, it 
is likely that all characterized cDNAs isolated from the 
elicitor-induced alfalfa library correspond to one gene. 

Elicitor Induction of ACCase Transcripts and Activity. The 
elicitor-induced increase in ACCase transcript levels (Fig. 6) 
precedes the increase in enzymatic activity (Fig. 5), and this 
timing correlates with the induction kinetics of transcripts/ 
activities of the series of enzymes involved in the synthesis 
of the isoflavonoid phytoalexin medicarpin from phenylala- 
nine (9). It is possible that most of the undetectable ACCase 
message in unelicited, control alfalfa cell cultures is associ- 
ated with the plastidial form, which may not cross-hybndize 
with the cytosolic ACCase probe (Fig. 6). 

Elicitation of isoflavonoid phytoalexin biosynthesis in le- 
gume cell suspension cultures requires three molecules of 
malonyl CoA as cosubstrate with 4-coumaroyl CoA in the 
chalcone synthase reaction (9). Thus, ACCase is coinduced 
with chalcone synthase in elicited alfalfa and soybean (8) cell 
suspensions, suggesting that insufficient malonyl CoA is 
produced under normal conditions to satisfy the require- 
ments for both flavonoid synthesis and fatty acid elongation. 
ACCase is also induced by UV irradiation in parsley cells, 
associated with the accumulation of UV-protective flavonoid 
compounds (7); in this species, fungal elicitation does not 
involve ACCase induction (42) as the parsley furanocoumann 
phytoalexins do not require malonyl CoA for their synthesis. 
It is not yet known whether elicitation and the concomitant 
induction of ACCase has quantitive or qualitative effects on 
fatty acid elongation. The present work opens up the possi- 
bility of investigating regulation of channeling of malonyl 
CoA into primary and secondary metabolism by genetic 
manipulation of ACCase levels. 

We thank Karen Dalkin for construction of the alfalfa cDNA 
library, Nancy Paiva for providing some of the elicited cell suspen- 
sion cultures, Gansan Gown for assistance in constructing the alfalfa 
genomic library, and Yves Poirier and Dave Shintam for critical 
reading of the manuscript. 

1. Post-Beittenmiller,D.,Jaworski,J. G. &Ohlrogge, J. B.(1991) 
J. Biol. Chem. 266, 1858-1865. 

2. Post-Beittenmiller, D., Roughan, G. & Ohlrogge, J. B. (1992) 
Plant Physiol. 100, 923-930. , 

3. Harwood, J. L. (1988) Annu. Rev. Plant Physiol. Plant Mol. 

4. E, Kl-H.Sz-Casillas, F. & Bai. D.-H. (1989) FASEB J. 
3, 2250-2256. tj t tl , c 

5. Egli, M. A., Gengenbach, B. G., Gonwald, J. W., Somers, 
D A. & Wyse, D. L. (1993) Plant Physiol. 101, 499-506. 

6. Pollard, M. & Stumpf, P. (1980) Plant Physiol. 66, 649-655. 
7 Ebel, J. & Hahlbrock, K. (1977) Eur. J. Biochem. 75, 201-209. 



Proc. Natl Acad. ScL USA 91 (1994) 4327 

8. Ebel, J., Schmidt, W. E. & Loyal, R. (1984) Arch. Biochem. 
Biophys. 232, 240-248. 

9. Dixon, R. A. & Harrison. M. J. (1990) Adv. Genet. 28, 165- 
234 

10. Lanz, T., Tropf, S., Marner, F.-J., Schroder, J. & Shroder, G. 
(1991) J. Biol. Chem. 266, 9971-9976. 

11. Kionka, C. & Amrhein, N. (1984) Planta 162, 226-235. 

12. Li, S.-J. & Cronan, Jr., J. E. (1992) /. BioL Chem. 267, 
855-863. , 

13. Li, S.-J. & Cronan, Jr., J. E. (1992) J. Biol. Chem. 267, 
16841-16847. ^ w . _ 

14. Kondo, H., Shiratsuchi, K., Yoshimoto, T., Masuda, T., 
Kitazono, A., Tsuru, D., Anai, M., Sekiguchi, M. & Tanabe, 
T (1991) Proc. Natl. Acad. Sci. USA 88, 9730-9733. 

15. Alix, J.-H. (1989) DNA 8, 779-789. 

16 Takai, T., Yokoyama, C, Wada, K. & Tanabe, T. (1988) J. 

BioL Chem. 263, 2651-2657. 
17. Lopez-Casillas, F., Bai, D.-H., Luo, X., Kong, L-S., Hermod- 

son, M. A. & Kim, K.-H. (1988) Proc. Natl. Acad. Set. USA 

85 5784 5788 

18 Al-Feel,"w., Chirala, S. S. & Wakil, S. J. (1992) Proc. Natl. 

Acad. Sci. USA 89, 4534-4538. 
19. Roessler, P. G. & Ohlrogge, J. B. (1993) 7. Biol. Chem. 268, 

19254-19259. 

20 Gornicki, P., Scappino, L. A. & Haselkorn, R. (1993) J. 
BacterioL 175, 5268-5272. 

21. Nikolau, B. J. & Hawke, J. C. (1984) Arch. Biochem. Biophys. 
228 86 96 

22. Kannangara, C. G. & Stumpf, P. K. (1972) Arch. Biochem. 
Biophys. 152, 83-91. 

23. Sasaki, Y., Hakamada, K., Suama, Y., Nagano, Y., Furusawa, 
I. & Matsuno, R. (1993) J. BioL Chem. 268, 25118-25123 

24. Dalkin, K., Edwards, R., Edington, B. & Dixon, R. A. (1990) 
Plant Physiol. 92, 440-446. 

25 Schumacher, H. M., Gundlach, H., Fiedlar, F. & Zenk, M. H. 
(1987) Plant Cell Rep. 6, 410-413. 

26 Bradford, M. M. (1976) Anal. Biochem. 72, 248-257. 

27 Shorrosh, B. S. & Dixon, R. A. (1991) Proc. Natl. Acad. Set. 
USA 88, 10941-10945. 

28. Millar, S. J., Dempsey, D. L. & Dickinson, P. D. (1992) Bio- 
Techniques 13, 554-562. 

29 Sanger, F., Nicklen, S. & Coulson, A. R. (1977) Proc. Natl. 
Acad. ScL USA 83, 8073-8077. 

30. Sanchez-Pescador. R. & Urdea, M. S. (1984) DNA 3, 339-343. 

31. Chomczynski, P. & Sacchi, N. (1987) Anal. Biochem. 162, 

32. Shorrosh, B. S. & Dixon, R. A. (1992) Plant /. 2, 51—58. 

33. Shorrosh, B. S. & Dixon, R. A. (1992) Plant Mol. BioL 18, 
1189—1190 

34 Maniatis, T., Fritsch, E. F. & Sambrook, J. (1982) Molecular 
Cloning: A Laboratory Manual (Cold Spring Harbor Lab. 
Press, Plainview, NY). 

35. Kozak, M. (1987) Nucleic Acids Res. 15, 8125-8132. 

36. Samols, D., Thornton, C. G., Murtif, V. L.. Kumar, G. K., 
Haase, F. C. & Wood, H. T. (1988) J. BioL Chem. 263, 
6461-6464 

37. Sternberg,' M. J. E. & Taylor, W. R. (1984) FEBS Leu. 175, 
387-392 

38. Moller, W. & Amons, R. (1985) FEBS Lett. 186, 1-7. 

39 Konodo, H., Shiratsuchi, K., Yoshimoto, T., Masuda, T., 
Kitazono, A., Tsuru, D., Anai, M„ Sekiguchi, M. & Tanabe. 
T (1991) Proc. Natl. Acad. Sci. USA 88, 9730-9733. 

40. Liu, Y., Hoffman, N. E. & Yang, S. F. (1983) Planta 158, 
437-441 

41. Ohlrogge, J. B., Pollard, M. R. & Stumpf, P. K. (1978) Lipid 

42 Howa^d, 2 J 10 L. & Ridley, S. M. (1990) FEBS Lett. 261, 261-264. 
43*. Roesler, K. R., Shorrosh, B. S. & Ohlrogge, J. B. (1994) Plant 
Physiol., in press. 



Plant Molecular Biology 21: 267-277, 1993. 

© 1993 Kluwer Academic Publishers. Printed in Belgium. 



Claim 67 

ACP-acyltransf erase 



267 

V 



The gene and the RNA for the precursor to the plastid-located 
glycerol-3-phosphate acyltransferase of Arabidopsis thaliana 

Ikuo Nishida, Yasushi Tasaka, Hideaki Shiraishi and Norio Murata 
National Institute for Basic Biology, Myodaiji, Okazaki 444, Japan 

Received 26 May 1992; accepted in revised form 18 September 1992 

Key words: Arabidopsis, chilling sensitivity, glycerol-3-phosphate acyltransferase, glycerolipid 
biosynthesis, RNA 

Abstract 

The gene and the RNA from Arabidopsis thaliana for the plastid-located glycerol-3-phosphate acyltrans- 
ferase (GPAT; EC 2.3.1.15) and their encoded product have been studied. The gene (designated ATS1) 
was isolated by screening a ADASH genomic library for cross-hybridization with a radiolabeled probe 
prepared from cDNA for GPAT from squash. cDNA clones representing the mRNA were isolated by 
screening a AZAPII cDNA library for hybridization with a radiolabeled probe prepared from a DNA 
fragment of ATS 1. The nucleotide sequences of the gene and the cDNA were determined, and the 5' 
end of the RNA was mapped by primer extension. Sequences similar to the TATA box, polyadenylation 
sequences and intron-splicing sequences were found at the expected locations. The pre-mRNA was 3288 
nucleotides long and contained 5' and 3 '-untranslated sequences of 57 and 442 nucleotides, respectively. 
The coding sequence of 1377 nucleotides was interrupted by 11 introns of 1412 nucleotides in total and 
the 3 '-untranslated sequence contained another intron of 94 nucleotides. The open-reading frame en- 
coded a polypeptide of 459 amino acid residues, the amino acid sequence of which was highly homol- 
ogous to those of precursors to plastid-located GPATs from squash and pea. The enzymatic activity 
of a gene product that was over-produced in Escherichia coli confirmed the identity of the gene. 

Abbreviations: ACP, acyl carrier protein; GPAT, glycerol-3-phosphate acyltransferase; IPTG, isopropyl- 
jS-thiogalactopyranoside. 



Introduction 

In all organisms higher than the eubacteria, 
glycerol-3-phosphate acyltransferase (GPAT; EC 



2.3.1.15) catalyzes the first reaction in the syn- 
thesis of glycerolipids de novo, namely, synthesis 
of 1-acylglycerol 3-phosphate (lysophosphatidic 
acid) from glycerol 3-phosphate and acylth- 



The nucleotide sequences reported herein for the gene and the mRNA will appear in the EMBL, GenBank and DDBJ Nucle- 
otide Sequence Databases under the accession numbers D00672 and D00673, respectively. The designation of the gene, ATS1, 
has been changed from atsA, which was used in our preliminary report [22] of this work, in accordance with the recommended 
nomenclature for genes in Arabidopsis. 
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ioesters. Therefore, this enzyme is indispensable 
for the biosynthesis of membrane lipids [18]. In 
cells of higher plants at least three types of GPAT, 
which differ in their subcellular location, contrib- 
ute to biosynthesis of glycerolipids. They also dif- 
fer from one another in terms of their substrate 
specificity and binding to membranes: GPAT 
bound to membranes of the endoplasmic reticu- 
lum is specific for acyl-CoA [3]; plastid-located 
GPAT is soluble in the stroma and uses acyl- 
(acyl-carrier protein) (acyl-ACP) as substrate [4]; 
and mitochondrial GPATs appear to vary in their 
binding to membranes among plant species and 
they react with both acyl-CoA and acyl-ACP [5]. 
Molecular cloning of genes for plant GPATs is 
important if we are to understand the regulation 
of biosynthesis of glycerolipids in plant cells. 

The plastid is the sole site of biosynthesis of 
fatty acids in plant cells [31]. The final product 
of the synthesis in the plastids is acyl-ACP [32]. 
The acyl group of acyl-ACP is either directly in- 
corporated into glycerolipids within plastids or, 
after hydrolysis, it is exported from plastids for 
biosynthesis of glycerolipids in the endoplasmic 
reticulum and mitochondria [28]. Plastid-located 
GPAT competes for acyl-ACP with another stro- 
mal enzyme, acyl-ACP hydrolase [23], which hy- 
drolyzes the acyl-ACP to fatty acids and ACP. 
Thus, the relative activities of the transferase and 
the hydrolase have been postulated to play a reg- 
ulatory role in determining the fate of fatty acids 
in the biosynthesis of glycerolipids in plant cells 
[ 15, 23]. Another interesting aspect of the plastid- 
located GPAT is its unique specificity toward 
substrates. We have demonstrated that the fatty- 
acid selectivity of the GPAT is a major determi- 
nant both of fatty-acid unsaturation of phosphati- 
dylglycerol from plastidial membranes and of the 
chilling sensitivity of plants [19, 20]. 

We have found three isomeric forms of plastid- 
located GPAT in squash cotyledons and have 
purified two of the isoforms to homogeneity [21]. 
A cDNA clone for one of the purified isoforms 
has been obtained by screening a squash cDNA 
library for immuno-cross-reactivity. The nucle- 
otide sequence of the cDNA has been determined 
[9]. 



Arabidopsis rhaliana provides a simple system 
for molecular cloning of plant genes [25]. The 
genetic amiK^is of mutants defective in plastid- 
located GP,\T activity suggests that the GPAT 
gene is a sui\gle-copy gene in the Arabidopsis ge- 
nome [13], This plant system seems to be very 
well suited tor studies of the sequences of the 
gene, the R\.\ and the protein responsible for 
plastid-located GPAT activity. We report here 
the cloning of the gene and the cDN A for a pre- 
cursor to GV AT fromy4. thaliana (the gene is des- 
ignated A2^$j\ and we discuss features of the 
gene, the R\ \ and the encoded amino acid se- 
quence. The identity of the gene is confirmed by 
the extent of sequence homology with GPAT from 
squash [9] - U id pea [33] and by the enzymatic 
activity of ^ gene product that is over-produced 
in E. coli. 



Materials and methods 

Plant materials and bacterial strains 

The LandsNerg erecta strain of A. thaliana was 
obtained fi\\ m Dr A.R. Kranz (Botaniches Insti- 
tut, J.W. Cvoethe-Universitat, Frankfurt am 
Main, FRGY Arabidopsis seeds were germinated 
on peat places irrigated with 1/1000-strength Hy- 
ponex solution (Hyponex, Miami, FL, USA), and 
the plates \\ CTe incubated for 3-4 weeks at 25 °C 
under continuous fluorescent illumination (20 /iE 
m" 2 s" 1 ). Leaves were used in further experi- 
ments. E. oo// BL21 (DE3) cells [30] and the 
translation plasmid pET-3c [27] were generous 
gifts from IV F.W. Studier (Biology Department, 
Brookhavec National Laboratory, NY, USA) 
and were chained through Professor Y. Shimura 
(Department of Biophysics, Kyoto University). 

General methods for manipulation of nucleic acids 

Extraction of nucleic acids from A, thaliana, 
DNA/DN A and RNA/DNA hybridization anal- 
ysis, and o* her techniques for manipulation of 
nucleic acios were performed by the procedures 
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described by Ausubel etal. [1]. Poly(A) + RNA 
was purified by adsorption to Oligotex dT30 resin 
(Nippon Roche, Tokyo). Radiolabeled probes 
were prepared by nick translation [26] using 
[cc- 32 P]dCTP (lllTBq/mmol; DuPont/NEN 
Research Products, Wilmington, DE, USA). Ex- 
cision and recircularization of pBluescript (sk~) 
plasmids that contained cloned cDNA inserts 
were carried out in situ using the helper phage 
R408 (Stratagene, La Jolla, CA, USA) in accor- 
dance with the manufacturer's protocol. 



Construction of recombinant libraries 

Construction of Arabidopsis genomic libraries fol- 
lowed the procedures described by Ausubel et al 
[1]. Partially S^AI-digested fragments of Ara- 
bidopsis DNA were fractionated by centrifugation 
on a sucrose gradient and fragments of about 
20 kb in length were ligated with Bam Hi-digested 
arms of a phage vector, ADASH (Stratagene). An 
Arabidopsis cDNA library was constructed by the 
manufacturer's protocol which was supplied with 
the cDNA synthesis kit (Pharmacia P-L Bio- 
chemicals, Uppsala, Sweden). cDNA was syn- 
thesized by priming with oligo(dT) 12 _ 18 primers 
and ligated with Eco Rl-digested arms of another 
phage vector, AZAPII (Stratagene) through 
EcoRl/Notl linkers. DNA coricatemers were 
packaged into phage particles with the packaging 
extract, Gigapack II Gold (Stratagene). The ge- 
nomic libraries were amplified but the cDNA li- 
brary was not amplified. 



Screening of recombinant libraries 

Phages were plated on agarose plates 
(10 cm x 14 cm) at a density of less than 20000 
plaque-forming units per plate. Plaques were lifted 
onto nylon membranes (GeneScreen Plus; Du- 
Pont, Boston, MA, USA), and the membranes 
were then rinsed in 5 x SSC(lx SSC is 0.15 M 
NaCl, 0.015 M sodium citrate pH 7.0, 0.5% 
SDS, 1 mM EDTA) at 42 °C for 1 h. Hybridiza- 
tion was preceded by prehybridization in the ab- 



sence of radiolabeled probes at 65 °C for 1 h. 
Hybridization cocktails contained 6 x SSC, 5 x 
Denhardt's solution (1 x Denhardt's solution is 
0.02% Ficoll, 0.02% polyvinylpyrrolidone, 
0.02% bovine serum albumin), 10% dextran sul- 
fate, 0.1% SDS and 100/ig/ml salmon testis 
DNA. A squash cDNA probe was prepared from 
the insert DNA of a cDNA clone for squash 
GPAT, AT03 [9]; an Arabidopsis DNA probe 
was prepared from the 2.6 kb Bam HI restriction 
fragment of an isolated genomic clone, A6; and an 
Arabidopsis cDNA probe was prepared from the 
1.3 kb Sail-Sty I restriction fragment of an iso- 
lated cDNA clone, AR1 (Fig. IB). Cross-hybrid- 
ization with the squash cDNA probe was carried 
out at a probe concentration of 10 5 cpm/ml and 
at 50 °C with washes in a washing solution that 
contained 2x SSC and 0.1% SDS at 40 °C, 
whereas hybridization with either the Arabidopsis 
DNA probe or the Arabidopsis cDNA probe was 
carried out at a probe concentration of 
5 x 10 5 cpm/ml, at 65 ° C, with washes in the same 
washing solution at 65 °C. 



Determination of DNA sequences 

DNA sequences were determined by the dideoxy 
chain-termination method [29] with deoxy-7- 
deazaguanosine triphosphate used in place of 
dGTP [17]. Oligonucleotide primers were syn- 
thesized with a DNA synthesizer (model 391; 
Applied Biosystems Japan, Osaka). 

Over-production of a GPAT fusion protein in E. coli 

A 1.2 kb Hga \-Eco RI restriction fragment of an 
isolated cDNA clone for GPAT from Arabidopsis 
(AR1 in Fig. IB) was blunt-ended and subcloned 
into the Bam HI site of pET-3c to obtain the 
plasmid pET-ARl. In this way, a new open read- 
ing frame, encoding a fusion protein between a T7 
bacteriophage leader sequence of 12 amino acid 
residues, which included the junction sequence, 
and the sequence encoding GPAT from Arabi- 
dopsis of 369 amino acid residues, was con- 
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Fz£. 1. Structures of the gene (ATS1) and the mRNA for 
plastid-located glycerol-3-phosphate acyltransferase from 
A. thaliana. A. Restriction maps of isolated genomic clones. 
The nucleotide sequence of a DNA segment, indicated by a 
bar with arrows at both ends, was determined with nested 
deletions [1] of XB3.6 and BB2.6. Restriction enzymes in 
parentheses indicate that the restriction sites are located within 
the linker sequences of the vector arms. B. Structures pre- 
dicted for the pre-mRNA (top) and the mRNA (middle) are 
compared with those determined for isolated cDNA inserts 
(bottom). Exons and introns are shown by open and closed 
boxes, respectively. The open reading frame is designated as 
ORF and the direction of translation is indicated by the ar- 
rowhead. Nucleotide sequences of cDNA inserts, AR1 and 
AR2, were determined by use of synthetic oligonucleotide 
primers. The 5' end of the mRNA was estimated by primer 
extension (Fig. 3) and the number 1 was assigned to the first 
nucleotide of the mapped 5' end. The cDNA inserts AR1 and 
AR2 start from nucleotides 43 and 64, respectively. 



structed under the control of a T7 bacteriophage 
promoter. Over-production of the GPAT fusion 
protein in £. coli BL21 (DE3), after transforma- 
tion with pET-ARl, was achieved by following 
the procedures described by Hoey et al [8]. Cells 
were suspended in 20 ml of a solution that con- 
tained 45 mM Tris-HCl pH 7.4, 2 mM DTT, 
10 mM sodium ascorbate, 10% (v/v) glycerol, 
1 mM benzamidine hydrochloride, 5 mM 6-ami- 
nohexanoic acid and 0.024 mM leupeptin, and 
were broken by passage through a chilled French 
pressure cell (FA-030; SLM Instruments, Ur- 
ban a, IL, USA) operated at 57 MPa. The homo- 
genate was clarified by centrifugation at 16 000 x g 
for 10 min and then at 100000 x g for 60 min to 
obtain the water-soluble protein extract. 



Assay of GPA T activity and analysis of proteins 

GPAT activity was measured as described pre- 
viously [21]. SDS-PAGE was carried out as de- 
scribed by Laemmli [14]. 



Results 

Isolation of a gene from Arabidopsis for a precur- 
sor to plastid-located GPAT 

An Arabidopsis genomic library was screened for 
cross-hybridization with the squash cDNA probe. 
From 18000 recombinant clones, two positive 
clones were obtained and designated 16 and A 10. 
These positive clones contained a 2.6 kb Bam HI 
restriction fragment of Arabidopsis DNA which 
hybridized with the squash cDNA probe. How- 
ever, when partial DNA sequences were deter- 
mined, they revealed that the positive clones were 
not full-length clones. To obtain segments of 
DNA that included the entire length of the gene, 



Fig. 2. Nucleotide sequences of the ATS1 gene and the 5' and 3' flanking sequences. The deduced amino acid sequence is shown 
in the one-letter code and numbered separately. Exon sequences are shown in brackets, A TATA-like sequence and a putative 
polyadenylation signal are indicated by single and double underlinings, respectively. 



1 TATTCATTATGTGTACACACAAACATGACTACTTCAA^ 

9 1 AAAGATGTGAGGTTATAATTTGGTCTTGAGTCTTCTITACTCACAAA 

181 TATC ATTATAATTATG AGTTCTT C AATGAAATAAGTTTACT 

271 AAAAATAAAAAAGAAAAACGTGAAAAAGAGGGACTCATGGGAJ_AMTAAAAAGGA^ 

361 CACACGCTCTGCTTCGACCAATCACCAAACACGCTTTAATGACT^ 

r MTLTFSSSAATVAVAAA' 

451 CCGTAACCTCCTCCGCTAGGGTTCCGGTTTATCCACTCGCTTCGTCGACTCTTCGTGGATTA 
18'TVTS SARVPVYPLASS TLRG LVSFRLTAKK 

541 TGTTTCTGCCGCCTCTTCGTTCTCGCGGCGGCGTTAGTCT 
48'LFLPPLRSRGGVSVRAMSELVQDKESSVAA 

631 GCATTGCTTTCAATGAAGCCGCCGGTGAGACGCCGAGT^ 
78'SIAFNEAAGE TPSELNHSRTFLD ARSEQj 

721 tactcttcttcgtcttctrrcctgaggaatttgaattga 

811 tcctttgtgttcataatcttatcgcgaaatcgatat^ 

901 tttgcttck^tcttgctccattcttgttaaatggataa^ 

991 ctttmgaaagatcagttcataaggatgcaatcagataatttc 

1081 tgcttgtattaacattgtgtttatcitcat^ 

106 * pLLSGIKKEAEAGRLP 

1171 o^tgttgcagcaggaatggaagaattgtattggaact^ 

122' ANVAAGMEELYWNYKNAJ 

1261 agtgctatgctttggaattaattagattaaagaacatcagtc^ 

139' ' t L 

1351 tagtggagcttccagggcagatgaaactgttgtatcaaac^ 

142' SGASRADETVVSNMSVAFDR MLLG.VEJ 

1441 GTCAATCTCTTGGAATGATTTTTCTACTACX3^ 

168' L° 

1531 CTTATACTTTTAATCCATATCATAAAGCAGTC^ 

169' PYTFN PYHKAVREPFDYYMFVHTYIRPLID 

1621 tcaaTstaagctggaatatgcactcatagttat^ 

199' F k| 

1711 gatcttccagttttcttcttgccatcaagtgtt^ 

201 ' J^NSYVGNAS 

1801 tatattctctgagctc&aagacaagattc 

209' IFSELEDKIRQj 

1891 aaagttctttccattcaggattttaatactcaactgtttaattacag^ 

220' [GHNIVLISNHQSEA 

1981 atccggctgtcatttctctattgcttgaagcacaatctcctttcataggaga^ 

234' DPAV ISLLLEA QSPFIGENlJ 

2071 TACGCTATTCTCTCTCTTCTTATTTGTTTCTTTGGTTG 
254 ' |jCCVAGDRVITDPLCKP 

2161 TCAGTATGGGAAGjjTATCACGAGCTTTCATTTGCAATGGT^ 

270' F S M G g 

2251 ggatttctactgctttcagEaacctcatatgtgtttactcgaaaaagcacatgaa 

275- [nlicvyskkhmnvdpelvdmkrk 

2341 CAAACACACGAAGCTTAAAGGAGATGGCTACAATGCTAAG^^ 

298' ANTRSLKE'MATMLRj 

2431 AATTAAAGGAAAAC^TTGATGTTTAGTCTGCTAATGTT^^ 
312 , |^SGGQLIWIAPSG 

2521 G AAGGG AC CG C CCG AAT C CTTCT ACTGGGGAATGGTTTCC^T AAGTT ATTTGATGGAAT ACAGAG AT CTT ATCTGAGTCTGGTAGATAT 

324' GRDRPNPSTGEWFPj 

2611 GCAAGTACTAATCTAGTAATGTTCAGjGCACCCm 
338 ^ [APFDASSVDNMRRLVEHSGAP 

2701 GACATATATATCCAATGTCTTTGCTTTGCTA 

359' GHIYPMSLLCYDIMPPPPQJ 

2791 CCGTTTTGGGCTAACTCTGGTTGACTTTCCTCTCATCATCACrrCAT^ 
37Q , I^VEKEIGEKRL 

2881 AGTTGGGTTTCACGGTACTGGACTATCAATTGCTCCTGAAATCAACTTCTCA 
388' VGFHGTGL'SIAPE INFSDVTADCESPN Ej 

2971 TCTGTAACACTCAGCACACCAATGACTTTAGTTTCAAGATCGAGTAAT^ 

3061 AGAAGCATACAGCCMGCTTTGTACAAGTCGGTGAATGAACAATACGAGATCTTAAACT 

418' EAYSQALYKSVNEQYEI LNSAIKHRRGVEA 

3151 ATCAACTTCAAGGGTCTCTTTGTCACAACCTTGGAATTAGTCTCT 

448' STSRVSLSQPWN* 459' J 

3241 AAGTTTTGTTGCAACT GTATATATATTC AGAGAG AGAG CATTGTTCTTT CATTTG C AG^AT ACACAAACACAAT CAATGGAAAAT ACTC A 

3331 AAAAAAAAAATGGAGATATTATTCACACGTTGTCCTAGGGAGTlTri T T'l''I T T T TTCAAACGAGTTCCAAAGCCTCCTTGAAACAACAAG 

3421 AAAAAAAAAAACAGGTACGTTGTTGGTTTCTAACGTAGGAAGAAGAAGAAGTTGGTT^ 

3511 CTTGCTAGGCTCTTAGATCT<^<^TATGGm 

3601 ACAATCCCAATTTATCCCTCTTCCTTCT^ 

3691 GCTTTGATATATCAGTCAGTMCACAATTTTGCGGAAAATGGGTTTTGTT 

3781 CT CTTAT AATTTTG CAGAAAAAAAT CTGAG AAAC AATTCT AC C^TTAGTCAAAAGTGTTT ACGAAAT AAAAGAGTTATTGGT CAAAACAT 

3871 TCCATTTCACTATTGAAAACMTTTTGriTGTGATCTCTA^ 

3961 TACTTGGC7^GTTGA(^GAGATGATCAGTGTCTGGCCTGTTTGGATCC 4008 
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another library was screened for hybridization 
with an Arabidopsis DNA probe that had been 
prepared from the 2.6 kbp Bam HI restriction 
fragment of A6. After screening of 18000 recom- 
binant clones, five positive clones were obtained. 
Their restriction digests revealed that they were 
overlapping, and all contained the 2.6 kb Bam HI 
restriction fragment. One of the clones, which 
contained a 13.6 kb DNA insert (A20 in Fig. 1A), 
was analyzed further. A20 was digested with 
Bam HI and Xba I, and 2.6 kb and 3.6 kb restric- 
tion fragments were obtained that were designated 
BB2.6 and XB3.6, respectively. The nucleotide 
sequences of BB2.6 and a 3' region of XB3.6 
were determined (Fig. 1A). The sequence of ge- 
nomic DNA, containing 4008 bp in total, is pre- 
sented in Fig. 2. The gene for GPAT was desig- 
nated ATS I (for acyltransferase 1). Blots of some 
restriction digests of the Arabidopsis genome gen- 
erated by combinations of Eco RI and Eco RV 
and also of Sal I and Sty I were probed with an 
Arabidopsis cDNA probe prepared as described 
above (data not shown). The numbers and sizes 
of the restriction fragments were those expected 
from the sequence of the genomic DNA, suggest- 
ing that the sequence of the gene represents the 
structure within the genome. 



Isolation of Arabidopsis cDNA clones for a precur- 
sor to plastid-located GPAT 

The Arabidopsis cDNA library was screened for 
hybridization with an Arabidopsis DNA probe 
prepared from the BB2.6 fragment of the genomic 
clone. After screening of 110000 recombinant 
clones, one positive clone designated AR1 was 
obtained (Fig. IB). Another set of 1 10000 recom- 
binant clones from the cDNA library was 
screened again for hybridization with an Arabi- 
dopsis cDNA probe prepared from the 1.3 kb 
Sail-Sty \ restriction fragment of AR1, and a 
positive clone designated AR2 was obtained 
(Fig. IB). Sequence determination revealed that 
the cDNA inserts of the AR1 and AR2 clones 
were 1445 bp and 1725 bp in length, respectively, 
and they contained an overlapping sequence of 



1424 bp (Fig. IB). Consequently, we determined 
a cDNA sequence of 1746 bp which included a 
poly(A) sequence of 5 bp. An RNA blot of Ara- 
bidopsis poly(A) + RNA was probed with the Ar- 
abidopsis cDNA probe and a cross-reactive band 
of 1.9 x 10 3 nucleotides in length was detected 
(Fig. 3). The primer extension of the RNA re- 
sulted in an extension product that extended 57 
nucleotides beyond the first ATG codon (Fig. 4). 
This result suggests that the mRNA consists of 
1783 nucleotides plus a poly(A) tail of ca. 120 
nucleotides. Neither ATG codons nor possible 
intron sequences were found between the mapped 
5' end and the first ATG codon, indicating that 
the cloned cDNA contains a full-length open- 
reading frame of 1377 bp. The nucleotide se- 
quence flanking the putative initiation codon, 5'- 
GCTTTAATGACT-3 ' , indicates that the 
reported consensus sequence for initiation of 
translation in plants, 5 ' -TAA AC AATG GCT 
[10], was not included in the present case. 



Deduced amino-acid sequence for a precursor to 
plastid-located GPAT from A. thaliana 

The open-reading frame encodes a polypeptide of 
459 amino acids. The amino-terminal leader se- 
quence from 1 to 65 amino acid residues is rich 

1 2 



1.9 kb-*- 



Fig. 3. RNA/DNA hybridization analysis of RNA from Ar- 
abidopsis. The RNA blot was probed with an Arabidopsis 
cDNA probe prepared from the 1.3 kb Sail-Sty I restriction 
fragment of AR1. The size of the hybridized band was cali- 
brated with RNA molecular weight marker I (Boehringer 
Mannheim, Germany) consisting of RNA markers 7.4, 5.3, 
2.8, 1.9, 1.6, 1.0, 0.6, 0.4 and 0.3 kb long. 1, 5jxg poly(A) + 
RNA; 2, 30 /ig total RNA. 
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Fig. 4. Analysis of the 5' end of the RNA by primer extension. 
The primer extension reaction was performed using a proce- 
dure for RNA sequencing [6] and the products of transcrip- 
tion were analyzed on an 8% denaturing polyacrylamide gel. 
In parallel, DNA sequencing reactions were performed and 
products were used as size markers. For both reactions, the 
same oligonucleotide primer, namely, 5'-CGTGAGAGT- 
CATTAAAG-3', was used. Poly(A) + RNA (10 //g) and 
pXB3.6 were used as templates for the primer extension re- 
action and the sequencing reaction, respectively. The position 
of the detected transcript is shown by an arrow. 

in hydroxy amino acids, serine and threonine, and 
small hydrophobic amino acids, alanine and va- 
line, which together are indicative of a transit pep- 
tide [ 12]. The site for removal by cleavage of the 
leader sequence was not determined in the present 
study (see Discussion). The deduced amino-acid 
sequence of the polypeptide from Arabidopsis ex- 
hibited 68.8% and 60.4% homology to those of 
the plastid-located GPAT from squash (368 
amino acids) [9] and a precursor to the plastid- 
located GPAT from pea (457 amino acids) [33]. 



Over-production of a GPAT fusion protein in E. coli 

Figure 5 shows the results of SDS-PAGE of pro- 
teins from BL21 (DE3) cells transformed with 
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Fig. 5. The over-expression of glycerol- 3-phosphate acyltrans- 
ferase in E. coli. The proteins from E. coli cells transformed 
with pET-ARl were analyzed by SDS-polyacrylamide gel 
electrophoresis in a 12.5% polyacrylamide gel. The 42-kDa 
protein is indicated by an arrow. 1, molecular mass markers; 
2, total proteins from cells before addition of isopropyl-/?- 
thiogalactopyranoside (IPTG); 3, total proteins from cells 3 h 
after addition of IPTG; 4, a water-soluble protein fraction 
from cells 3 h after addition of IPTG. 



pET-ARl. A 42kDa polypeptide was synthe- 
sized upon addition to the bacterial culture of 
isopropyl-/?-thiogalactopyranoside (IPTG; Fig. 5, 
lane 3). The relative molecular mass of the pro- 
tein was of the size anticipated for the fusion 
protein encoded by pET-ARl (M T 42422). This 
polypeptide accounted for as much as 30% of the 
total water-soluble proteins after a further 3 h in- 
cubation (Fig. 5, lane 4). A substantial level of 
GPAT activity was detected in the soluble protein 
extracted from cells transformed with pET-ARl 
but not from cells transformed with the vector 
plasmid, pET-3c, and the specific activity of 
GPAT was increased 6-fold after a 3 h incubation 
with IPTG (Table 1). These results suggest 
that the enzymatic activity is attributable to the 
water-soluble polypeptide and not to the mem- 
brane-bound glycerol-3-phosphate acyltrans- 
ferases from E. coli [16]. Hence, we conclude 
that the ATS1 gene is a structural gene that en- 
codes a precursor to the plastid-located GPAT of 
A. thaliana. 
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Table 1. Over-expression of GPAT activity in soluble protein 
extracts from E. coli BL21 DE3). 



Transform ant 


Activity 
(nmol/min) 


Protein 
(mg) 


Specific 
activity 
(nmol/min 
per mg 
protein) 


BL21 (DE3)/pET-ARl 








-IPTG 


1600 


19 


84 


+ IPTG, then 3 h 


14000 


30 


470 


BL21 (DE3)/pET-3c 








-IPTG 


0 


18 


0 


+ IPTG, then 3 h 


0 


21 


0 



Cells were grown in 500 ml of broth until the optical density 
at 600 nm reached 0.5. Then half of the culture was kept for 
control measurements ( - IPTG), and the other half was sup- 
plemented with 0.4 mM IPTG and incubated for another 3 h 
( + IPTG, then 3 h). Water-soluble protein extracts were pre- 
pared and GPAT activity and protein contents were measured 
as described in the text. 

Discussion 

Alignment between the genomic DNA and cDNA 
sequences revealed that the genomic DNA se- 
quence contained 11 intron sequences of 1412 bp 
in total, each interrupting the nucleotide sequence 
of the open reading frame, and another intron 
sequence of 94 bp in the 3 ' untranslated sequence. 
All the intron sequences contained invariant GT 
and AG dinucleotides at the 5' and 3' splice sites, 
respectively. Sequences for intron-splicing sites 
were consistent with the consensus sequences for 
plant intron s [7]. 

Sequences characteristic of expressed eukary- 
otic genes are found in the^irS7 gene. However, 
the function of these sequences in the expression 
of the AT SI gene remains to be examined exper- 
imentally. No TATA box consensus sequences 
[ 10] are found within the expected distance, i.e., 
32 + 7 nucleotides upstream, from the mapped 5' 
end of the RNA. A TATA-like sequence at po- 
sition -29, 5 ' -GGG ATAAATAAAA-3 ' , may 
play the role of a TATA box. The absence of a 
typical TATA sequence in housekeeping genes 
has occasionally been observed in animal and 
plant genes [2, 10]. The putative polyadenylation 



signal, 5'-AATAAT-3', similar to the plant con- 
sensus, 5'-AATAAA-3' [11], is located 37 nu- 
cleotides upstream of the poly(A) site, almost 
within the expected distance (27 + 9 nucleotides) 
[11]. 

In the present study, the 5' and 3' flanking 
sequences of the ATS 1 gene were also determined. 
However, an experimental examination of the 
possible significance of these sequences in the 
expression and regulation of the ATS1 gene re- 
mains to be performed. A homology search indi- 
cated that a unique sequence, 5'-ATTCATTAT- 
3', is present in the 5' untranslated sequences of 
the ATS 1 gene and of one of the ACP genes from 
Arabidopsis [24], However, the significance of this 
sequence in gene expression is unclear. The 3'- 
untranslated sequence contained an intron se- 
quence, but it is not known whether the splicing 
of this intron serves any special function in the 
maturation and subsequent translation of the 
mRNA. The 3 '-untranslated sequence of the 
mRNA contains two poly(A) tracts, one of which 
appears to have been annealed with oligo(dT) 
primers to cause priming for synthesis of cDNA, 
with resultant isolation of the AR1 clone. Finally, 
it should be noted that a one-base insertion into 
the cDNA sequence occurred in the 3 '-untrans- 
lated region of the AR2 cDNA clone; the deter- 
mined cDNA sequence contained a (T) 15 tract in 
the 3 '-untranslated sequence, whereas the ge- 
nomic DNA sequence from nucleotide number 
3373 to 3386 contained a (T) 14 tract. 

Figure 6 shows the alignment of amino-acid 
sequences of precursors to the plastid-located 
GPATs from Arabidopsis, pea [33] and squash 
[9]. Amino-acid sequences are conserved at least 
7 regions (underlined in Fig. 6). The significance 
of these conserved sequences in the enzymatic 
activity and the interaction with substrates re- 
mains to be examined. In contrast to the con- 
served sequences, the amino-acid sequence from 
1 to 90 of the precursor from Arabidopsis shows 
only 16.6% and 28.8% homology to the corre- 
sponding pre-sequences of the precursors from 
pea and squash, respectively. In particular, the 
amino-acid sequence from 61 to 90 of the precur- 
sor from Arabidopsis greatly differs from the cor- 
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Fig. 6. The alignment of amino-acid sequences deduced from open-reading frames of cDNAs for precursors to plastid-located 
GPATs from Arabidopsis, pea [33] and squash [9]. Amino acids are given in one letter code, and gaps indicated by ( - ) are in- 
serted to maximize similarity among sequences. Identical amino acids are shown in shadow boxes. A, P and S on the right col- 
umn stand for Arabidopsis, pea and squash, respectively. Each amino-acid sequence is numbered separately on the left column. 
The deduced amino-acid sequence for a precursor to the GPAT from squash is truncated at the amino terminus [22, 33]. 
Ammo-terminal residues of purified GAPTs from squash [9] and pea [33] are indicated by an arrow. 



responding sequence of the precursor from pea. 
Interestingly, however, the sequence of the pre- 
cursor from Arabidopsis is similar to the aligned 
sequence of the precursor from squash. The 
amino-terminal residues of purified GPATs from 
squash [21] is 70 E. The corresponding amino- 
terminal residue of mature GPAT from Arabidop- 
sis is 91 E (Fig. 6). If this is the amino terminus of 
the mature GPAT, the molecular mass of the ma- 
ture polypeptide of the GPAT from Arabidopsis is 
calculated to be 41 204 Da. However, we ob- 
served that the precursor from Arabidopsis, which 
had been over-expressed in transgenic tobacco 
and targeted into the chloroplasts, was estimated 
to be of 43 kDa by SDS-PAGE (Ishizaki, Nishida 
and Murata, unpublished result). Hence, it is 
likely that the processing site of the precursor to 
GPAT from Arabidopsis is located ca. 20 amino- 
acid residues upstream of the 91 E residue. It is 
also likely that the purified GPATs from pea and 



squash have been truncated off the amino-termini 
during purification. 

In conclusion, we have cloned the structural 
gene from A. thaliana for a precursor to plastid- 
located GPAT. Cross-hybridization provides a 
useful approach for the isolation of this gene and 
the cDNA for plastid-located GPAT, at least 
from dicots. It will be of interest to determine 
whether the cloned DNA and cDNA can also 
serve as heterologous probes for detection of other 
genes for acyltransferases from higher plants and 
cyanobacteria. Cloning of the ATS1 gene with 
flanking DNA sequences will allow the study of 
cis and trans regulatory elements of the gene. 
These studies and complementary studies of the 
gene for acyl-ACP hydrolase will allow evaluation 
of their regulatory role in the biosynthesis of glyc- 
erolipids in plant cells. As pointed out in our 
previous study [9], plastid-located GPAT from 
A. thaliana exhibits no homology to GPAT from 
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Escherichia coli [16]. Compilation of basic data, 
such as the sequences of the genes, RNAs and 
proteins for plant GPATs is a prerequisite for an 
understanding of the molecular diversity and evo- 
lution of GPATs in plants and animals. 
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ABSTRACT 

A cerulenin insensitive 3-ketoacyl-acyl carrier protein synthase 
has been assayed in extracts of spinach (Spinacia oleracea) leaf. 
The enzyme was active in the 40 to 80% ammonium sulfate 
precipitate of whole leaf homogenates and catalyzed the synthe- 
sis of acetoacetyl-acyl carrier protein. This condensation reaction 
was five-fold faster than acety!-CoA:acyl carrier protein transac- 
ylase, and the initial rates of acyl-acyl carrier protein synthesis 
were independent of the presence of cerulenin. In the presence 
of fatty acid synthase cofactors and 100 micromolar cerulenin, 
the principal fatty acid product of de novo synthesis was butyric 
and hexanoic acids. Using conformationally sensitive native poly- 
acrylamide gel electrophoresis for separation, malonyl-, acetyl-, 
butyryl-, hexanoyl, and long chain acyl-acyl carrier proteins could 
be detected by immunoblotting and autoradiography. In the pres- 
ence of 100 micromolar cerulenin, the accumulation of butyryl- 
and hexanoyl-acyl carrier protein was observed, with no detect- 
able long chain acyl-acyl carrier proteins or fatty acids being 
produced. In the absence of cerulenin, the long chain acyl-acyl 
carrier proteins also accumulated. 



It has been known for decades that fatty acid biosynthesis 
involves extending an acyl-ACP 1 stepwise by two carbons. In 
plants and bacteria, this synthesis is catalyzed by a type II 
fatty acid synthase, a readily dissociable group of enzymes 
which can be individually isolated and studied (1). One of the 
initial reactions leading to chain elongation is catalyzed by 
the enzyme 3-ketoacyl-ACP synthase, frequently referred to 
simply as the 'condensing enzyme.' The spinach fatty acid 
synthase was initially separated into its individual components 
by Shimakata and Stumpf (13, 14). In that study, they found 
two 3-ketoacyl-ACP synthases, designated I and II. 3-Keto- 
acyl-ACP synthase I was responsible for the majority of the 
condensations, using acyl-ACPs ranging from 2:0-ACP to 
I4:0-ACP. 3-Ketoacyl-ACP synthase II had its primary func- 
tion in stearic acid synthesis, and thus was most active with 
14:0-ACP and 16:0-ACP. A diagnostic difference between the 
two synthases was their sensitivity to the antibiotic cerulenin. 
3-Ketoacyl-ACP synthase I was very sensitive and was inhib- 
ited 100% by as little as 10 cerulenin. In contrast, 3- 
ketoacyl-ACP synthase II was inhibited only 50% by 50 mm 
cerulenin. Thus, the characteristic sensitivity of fatty acid 

'Abbreviation: ACP. acyl carrier protein. 



biosynthesis to cerulenin can be attributed entirely to the 
inhibition of 3-ketoacyl-ACP synthase I. 

Similarly, the 3-ketoacyl-ACP synthases found in Esche- 
richia coii, are characteristically cerulenin sensitive (5). Re- 
cently, a third condensing enzyme has been reported in E. 
coii. which has been named acetoacetyl-ACP synthase (6). 
This 3-ketoacyl-ACP synthase is distinctly different from the 
other 3-ketoacyl-ACP synthases in several important respects. 
It is cerulenin insensitive, is specific for very short chain acyl- 
ACPs, and it prefers acetyl-CoA over acetyl-ACP (7). As a 
consequence of this last property, it would appear that this 
enzyme can bypass the much slower acetyl transacylase step 
and as a result, the acetyl transacylase would not be rate 
limiting in fatty acid biosynthesis. 

In light of these findings in the E. coli system, we have 
reexamined the spinach system and report here the presence 
of a third 3-ketoacyl-ACP synthase which is completely insen- 
sitive to cerulenin, active only with short chain acyl thioesters, 
and appears to prefer acetyl-CoA to acetyl-ACP. 

MATERIALS AND METHODS 

Materials 

Spinach {Spinacia oleracea) was purchased locally. Spinach 
ACP was the generous gift of J. Ohlrogge, Michigan State 
University. Malonyl-CoA was prepared by the method of 
Rutkoski and Jaworski (12). [l- 14 C]AcetylCoA was prepared 
from [ I4 C]sodium acetate by preparing an acetyl-imidazole 
intermediate (3). Acyl-ACPs were prepared by the method of 
Cronan and Klages (4). All other chemicals were reagent 
grade or best available. 

Enzyme Preparation 

A crude spinach leaf extract was prepared by the method 
of Shimakata and Stumpf (14). The 40 to 80% ammonium 
sulfate precipitate was dissolved and dialyzed against 50 mM 
potassium phosphate (pH 8.0), 20% (v/v) glycerol, and 2 mM 
dithiothreitol. The protein concentration of this crude spinach 
leaf extract was 19 mg/mL as determined by the method of 
Bradford (2). 

Enzyme Assays 

The in vitro synthesis of acyl-ACPs was carried out under 
assay conditions for acetyl-CoA:ACP transacylase and fatty 
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acid synthase. For acetyl-CoA:ACP transacylase, the reaction 
mixture contained 100 mM Tris (pH 8.0), 0.5 itim 2-mercap- 
toethanol, 60 of spinach ACP, 10 fiM [l- I4 C]acetyl-CoA 
(56 mCi/mmol), and 15 mL of crude spinach leaf extract (0.29 
mg protein) in a total volume of 150 mL. Malonyl-CoA:ACP 
transacylase was assayed the same as the acetyl transacylase, 
but 100 mm [2- ,4 C]maionyl-CoA (10 mCi/mmol) replaced the 
acetyl-CoA. For fatty acid synthase assays, the reaction mix- 
ture contained in addition to the components of the acetyl 
transacylase reaction, 100 mm malonyl-CoA, 1 mM NADH, 2 
mM NADPH. and 100 hm cerulenin when indicated. Ceru- 
lenin was maintained as a 1.0 mM stock solution (pH 4.0) at 
4°C. Reactions were stopped by transferring an aliquot of the 
reaction mix at the designated interval to an equal volume of 
10% (w/v) TCA, mixing, and storing on ice for 5 to 15 min. 
All samples were diluted with 1 mL of 5% TCA, centrifuged, 
and the supernatant discarded. Pellets were washed with an 
additional 1 mL of 5% TCA and then redissolved in 25 fiL of 
50 mM Mes, approximate pH of 6.3. An aliquot of this 
solution was analyzed by liquid scintillation spectrometry or 
used immediately for subsequent analysis. 

Electrophoretic Separation and Analysis 

Native PAGE (11), with 13% acrylamide and 1 mm thick- 
ness were run at 15 mA and at 15°C Samples in 50 mM Mes 
were diluted with an equal volume of 20% (v/v) glycerol, 
0.002% (w/v) bromophenol blue, and loaded onto gels di- 
rectly. Proteins were transferred to nitrocellulose (8). For 
autoradiography, nitrocellulose was dried, and exposed to film 
for 48 h. 

Immunoblots were carried out after proteins were trans- 
ferred to nitrocellulose. Nitrocellulose was blocked with 5% 
(w/v) nonfat dry milk (Carnation Co., Los Angeles, CA) in 
TBST (10 mM Tris [pH 8.0], 150 mM NaCl, 0.05% [v/v] 
Tween 20) for 1 h. Spinach ACP I antibody was added to the 
blocking solution and incubated an additional 1 h. The anti- 
body solution was poured off and the nitrocellulose was 
washed three times with TBST for 10 min each. Immuno- 
staining was carried out using the anti-rabbit IgG alkaline 
phosphatase conjugate. 

HPLC 

Acyl groups esterified to the ACP in the- electrophoresis 
samples were analyzed on HPLC following mild alkaline 
hydrolysis. These samples were hydrolyzed by adjusting to 
approximately pH 9.5 and treating with 100 mM 2-mercap- 
toethanol for 1 h at 37°C (9). Samples were directly injected 
into an HPLC system equipped with ODS column, eluted by 
the solvent system indicated in Table I and the radioactive, 
fractions collected. Retention time of each fatty acid was 
compared to the available standard. 

RESULTS AND DISCUSSION 

Initial Velocity Measurements 

In E. coli, the acetoacetyl-ACP synthase was distinguished 
from the 3-ketoacyl-ACP synthases by its insensitivity to 



cerulenin (6). In addition, it preferred acetyl-CoA as a sub- 
strate and could be distinguished from acetyl transacylase by 
a 5- to 1 0-fold higher activity (7). Since Shimakata and Stumpf 
(15) had demonstrated that the acetyl transacylase was the 
slowest reaction step in fatty acid biosynthesis, we reasoned 
that if the acetyl transacylase step was bypassed by a more 
active 3-ketoacyl-ACP synthase, the overall rate of fatty acid 
biosynthesis should exceed the rate limiting acetyl transacylase 
step. The reactions used [ l4 C]acetyl-CoA as the only labeled 
substrate, in the absence of ATP, to ensure that any fatty 
acids synthesized would be labeled only once and thus have 
the same specific activity as the product of the acetyl trans- 
acylase reaction. Furthermore, the commonly employed 
methods for assaying fatty acid synthase were not used, in 
that these methods are dependent on measuring longer chain 
(10-18 carbons) fatty acids only. Since we were interested in 
measuring true initial velocities, we expected that a significant 
amount of the fatty acid synthase products would still be 
shorter chain fatty acids, that they would still be esterified to 
ACP, and they could be measured by acid precipitation. Acid 
precipitation was also the method used for the measurement 
of the acetyl transacylase, and thus would allow a direct 
comparison of the rates of the two reactions. The assay 
conditions for acetyl transacylase and fatty acid synthase 
differed only by supplementing the transacylase assay mixture 
with NADH, NADPH, and malonyl-CoA to allow for the 3- 
ketoacyl-ACP synthase and reductase reactions to occur. As 
illustrated in Figure 1, the initial velocity of the fatty acid 
synthase was 5 -fold higher than the acetyl transacylase (fatty 
acid synthase: 1 1.7 pmol/min/mg protein versus acetyl trans- 
acylase 2.2 pmol/min/mg protein). These initial velocities 
were calculated from the linear portion of the progress curves, 
i.e 1 to 4 min. The initial velocity data indicate that since the 
fatty acid synthase was much faster than the acetyl transac- 
ylase, the transacylase reaction was not rate-limiting and was 
bypassed by fatty acid synthase. Furthermore, it would indi- 
cate that acetyl-CoA rather than acetyl-ACP was the primer 
in these reactions. 

To further examine if acetyl-CoA was preferred over acetyl- 
ACP as the primer for fatty acid synthesis, incubations con- 
taining 10 ftM [ 14 C]acetyl-CoA were supplemented with 10 
and 20 fiM unlabeled acetyl-ACP. If there was equal or greater 
preference for acetyl-ACP over acetyl-CoA, the incorporation 
of label should have been decreased by 50% or more with 
equi molar acetyl-ACP. However, the incorporation of acetyl- 
CoA into fatty acids was diminished by only 25%, even with 
a twofold excess of acetyl-ACP. This decrease may even be 
attributed to dilution of label in the acetyl-CoA by the acetyl- 
ACP as a result of the reversible acetyl transacylase activity. 
Thus, this data was consistent with a strong preference for 
acetyl-CoA as the primer for fatty acid synthesis. 

The effect of cerulenin on the rate of fatty acid synthase 
under these assay conditions was also determined (Fig. 1). 
The initial rate of fatty acid synthase was essentially un- 
changed by the addition of 100 fxM cerulenin. This concentra- 
tion was 10-fold higher than the concentration needed to 
completely inhibit 3-ketoacyl-ACP synthase 1(14), and indi- 
cated that a cerulenin insensitive 3-ketoacyl-ACP synthase 
was functioning in these reactions. Enzyme extracts were 
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Table I. HPLC Data that Confirm the Identity of the Acyi Groups Attached to the ACPs 



Acyt Group Retention Time Elution Solvent 





min 




Acetate 


5.3 


5 mM triethylamine (pH 5.0) adjusted with H 3 PO* 


Malonate 


6.1 


5 mM triethylamine (pH 5.0) adjusted with H3PO4 


Acetoacetate 


6.5 


5 mM triethylamine (pH 5.0) adjusted with H 3 P0 4 


Butyrate 


6.3 


50% MeOH in 10 mM H 3 PO* 


Hexanoate 


17.1 


50% MeOH in 10 mM H 3 P04 




2 5 15 



E 
15 



Time (min) 

Figure 1. in vitro synthesis of acyl-ACP was measured under con- 
ditions for acetyl-CoA: ACP transacylase (curve A), fatty acid synthase 
activity (curve B), and fatty acid synthase activity with 100 hm 
cerulenin (curve C) as described in "Materials and Methods." Reac- 
tions were stopped by transferring 25 of the reaction mix at the 
designated interval to an equal volume of 10% (w/v) trichloroacetic 
acid (TCA), mixing, and storing on ice until the end of the experiment. 
Samples were prepared for liquid scintillation spectrometry as de- 
scribed in "Materials and Methods." 

treated with cerulenin in several ways to avoid any artifacts 
due to stability or solubility of the antibiotic. Typically, a 1 
mM stock at pH 4 stored 4°C was used (10). Freshly prepared 
solutions were also used as well as a freshly prepared ethanol 
solution (6), with no inhibition observed. Preincubation of 
the enzyme with cerulenin for as long as 30 min had no effect, 
as was the case with using lower cerulenin concentrations (10 
and 50 /iM). 

Acrylamide Gel Analysis of Products 

As the kinetic data were based on measuring acid precipi- 
table radiolabeled products, it was necessary to confirm that 
these products were acyl-ACPs as well as to determine the 
identity of the acyl groups in the fatty acid synthase products. 
The native acrylamide gel system employed by Rock and 
Cronan (11) was used to successfully separate the acyl-ACPs 



acetoacetyl ACP — 
malonyl ACP-** 
acetyl ACP— 

4:0 ACP"^ 



6:0 ACP 



LCaeyl + 



Figure 2. Autoradiogram of a native 13% PAGE of [ 14 C]acyl-ACPs 
produced in vitro from spinach under the indicated reaction condi- 
tions. Enzyme assays were carried out and acyl-ACPs were prepared 
as described in "Materials and Methods." Each reaction was run for 
2, 5, or 15 min, as indicated at the top of each lane. A, Acetyl- 
CoA:ACP transacylase— [ 14 C] precursor was [1- 14 C]acetyl-CoA; B, 
fatty acid synthase with 100 cerulenin, but omitting both NADPH 
and NADH; C, fatty acid synthase with 100 mm cerulenin; D, fatty acid 
synthase without cerulenin present; E, malonyl-CoA:ACP transacyl- 
ase (same as acetyl-CoA:ACP transacylase, but with [2- 14 C]malonyl- 
CoA as the substrate. Native gel was run at 15 mA and 15°C. 
Proteins were transferred to nitrocellulose, dried, and exposed to film 
for 48 h. 

(Fig. 2). The acid precipitable product of the acetyl transac- 
ylase assay was exclusively [ 14 C] acetyl- ACP, present in the 
expected low quantity. When the assay mixture was supple- 
mented with only malonyl-CoA, a single product, acetoacetyl- 
ACP, was expected. There was only one radioactive band, 
tentatively identified as acetoacetyl-ACP, detected in these 
incubation products, migrating slightly behind the malonyl- 
ACP (compare lanes B and E, Fig. 2). These bands always 
lacked sharpness and we attribute this to instability of aceto- 
acetate at pH 9.0. Supplementing this assay with NADH and 
NADPH resulted in production of saturated acyl-ACPs. When 
cerulenin was present, the major products were butyryl- and 
hexanoyl-ACP, with the precursor-product relationship ap- 
parent between them. In the absence of cerulenin, longer 
chain acyl-ACPs were also produced, as expected. Thus, these 
results were consistent with the presence of a cerulenin insen- 
sitive short chain 3-ketoacyl-ACP synthase, analogous to the 
acetoacetyl-ACP synthase found in E. coli. 

The identities of the radiolabeled proteins as acyl-ACPs 
were determined by immunoblotting with spinach ACP an- 
tibody (Fig. 3) and HPLC analysis of the acyl groups (Table 
I). The products of 15 min reactions (Fig. 3, lanes 4 and 5) 
were compared with standards and the bands observed on the 
autoradiogram. The radiolabeled bands corresponding to ma- 
lonyl-. butyryl-, hexanoyl-, and long chain acyl-ACPs were 
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1 2 3 4 5 6 7 



malonyl-ACP-^ 

ACP-^ 
acetyl-ACP 

4:0- ACP-* 



6:0 -ACP-^ 



8:0- ACP-^ 
10-16:0-ACP"^ 




Figure 3. Immunoblot of a native 13% PAGE of ACPs and acyl- 
ACPs. Lanes 1 , 2 t 7, spinach ACP reduced with 5, 20, and 20 mM 2- 
mercaptoethanol, respectively. Lanes 3, 6, mixture of acetyl-, buty- 
ryl-, hexanoyl-, octanoyl-, decanoyl-, dodecanoyl-, and palmitoyl-ACP 
standards. Lane 4, products of 15 min fatty acid synthase reaction 
in the presence of 100 cerulenin. Lane 5, products of 15 min fatty 
acid synthase reaction without cerulenin. Enzyme assays were car- 
ried out and acyl-ACPs were prepared as described in "Materials and 
Methods." Native gel was run at 15 mA and 15°C. Proteins were 
transferred to nitrocellulose, dried, and immunoblotted with an anti- 
body to spinach ACP I. 

immunoreactive with the ACP antibody. Several unlabeled 
immunoreactive bands were detected, including two unchar- 
acterized forms of ACP. These two bands, which migrated 
just ahead of reduced ACP, disappeared upon reduction of 
the samples with 2-mercaptoethanoI, and thus appeared to be 
oxidized forms of ACP. Because these bands were present in 
our incubation products, they obscured any acetyl-ACP band 
that might have been present. Acetyl-ACP (upper band in 
lanes 3 and 6) migrated at almost the same rate as one of 
these ACP bands. Since 2-mercaptoethanol catalyzed the 
rapid hydrolysis of the acyl-ACPs (data not shown), we could 
not add 2-mercaptoethanol to our samples (lanes 4 and 5) to 
examine for the presence of acetyl-ACP. Note in Figure 2, 
lanes C and D, that there was a minor radiolabeled protein, 
migrating between malonyl- and acetyl-ACP, which was con- 
sistently observed when NADH and NADPH was present 
during the incubation. This band is about where we expected 
to see reduced ACP (Fig. 3, lanes 1, 2, and 7), but there was 
no corresponding band visible in our samples on the immu- 
noblots (Fig. 3, lanes 4 and 5). Thus, the labeled band observed 
on the autoradiogram presumably was not an acyl-ACP and 
was not further characterized. Finally, acetoacetyl-ACP could 
not be clearly identified on the immunoblots. The incubations 
which produced f ,4 C]acetoacetyl-ACP also contained high 
levels of unlabeled malonyl-ACP, which obscured the diffuse 
acetoacetyl-ACP band. 

In addition to comparison with standard acyl-ACPs on 
PAGE, the identification of the acyl group esterified to ACP 
was also determined by HPLC. Each of the acyl-ACP samples 
was hydrolyzed, and the free acids analyzed directly by HPLC. 
Because of the wide range of hydrophobicity of the acyl 
groups, several different isocratic solvent systems (see Table 



I) were used to analyze free acids. Samples containing only 
one or two radioactive acyl-ACPs were used in this analysis 
to eliminate ambiguity in the identification of the acyl-ACP. 
For each acyl-ACP, the retention of the corresponding free 
acid was compared with an authentic standard. 

In summary, we have reported here that a third 3-ketoacyl- 
ACP synthase was identified in spinach. It had the distinctive 
characteristics of being cerulenin insensitive, specific for the 
synthesis of short chain fatty acids, i.e. C4 and C6, and 
apparently could use acetyl-CoA as a substrate. This enzyme 
appeared to be completely analogous to the E. coli acetoacetyl- 
ACP synthase reported by Jackowski and Rock (6) with an 
important distinction that in vitro products included C6 fatty 
acids. Thus, it would more appropriately be referred to as a 
short chain 3-ketoacyl-ACP synthase rather than a acetoace- 
tyl-ACP synthase. 
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Abstract 

Two linked genes, Al and A2, coding for nearly identical isoforms of the acyl carrier protein (ACP) were 
isolated from an Arabidopsis thaliana (Columbia) genomic library and sequenced. The amino acids deduced 
from the nucleotide sequence of the two genes indicate they encode distinct transit peptides, but the 
mature proteins are the same except for residue 79. Both genes are predicted to contain three introns in 
similar positions, although they differ in sequence and length. The introns interrupt regions coding for 
a) the transit peptide, b) the junction of the transit peptide and mature protein, and c) the highly conserved 
domain surrounding serine 38 to which the phosphopantetheine is attached. Primer extension analysis 
indicates that at least Al is active in young plants. 



The acyl carrier protein (ACP) is a key com- 
ponent in de novo fatty acid biosynthesis. It is a 
small acidic protein with a 4' -phosphopante- 
theine prosthetic group, attached through a serine 
located 38 residues from the N-terminus, to which 
growing fatty acid chains are covalently bound. In 
higher plants the major site of fatty acid biosyn- 
thesis appears to be the chloroplast (see for review 
Ohlrogge, 1987). Thus, ACP is synthesized as a 
precursor, imported into the organelle, and pro- 
cessed to its mature form. Recently, it was 
demonstrated that the chloroplast contains a 
holo-ACP synthase activity capable of trans- 
ferring the prosthetic group from CoA, suggesting 
that production of the functional moiety occurs 
after translation into the organelle (Fernandez 
and Lamppa, 1990). At least two isoforms of 
ACP have been detected by SDS-PAGE and 



peptide sequence analysis (Hoj and Svendsen, 
1984; Ohlrogge and Kuo, 1985), but characteriza- 
tion of cDNA from Brassica (Safford et al, 1988; 
Rose etal, 1987), barley (Hansen and von 
Wettstein-Knowles, 1989) and spinach (Scherer 
and Knauf, 1987) indicates that other variants 
also exist in the cell. 

An Arabidopsis thaliana (Columbia) cosmid ge- 
nomic library (Olszewki et al, 1988) was screened 
with a Brassica campestris seed-specific cDNA 
clone (Rose etal, 1987) coding for the ACP 
precursor. Cosmid DNA from a positive clone 
was digested with the enzymes shown in Fig. 1, 
and was predicted to contain ca. 18 kb of genomic 
DNA. Southern blot analysis using the Brassica 
cDNA insert (570 nt) as a probe showed that 
multiple bands hybridized in almost all of the 
restriction enzyme profiles. For example, in the 
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B. 21-mer oligo probe 
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//wd III profile (Fig. 1 A, lane 1) fragments of 3.9, 
2.2 (open arrow) and 1.6 kb (closed arrow) 
hybridized with the cDNA insert that contains the 
entire coding sequence of the ACP precursor. The 
simplest pattern observed was after the Pst l/Sac I 
double digestion (Fig. 1 A, lane 1 1), which showed 
two bands hybridizing with sizes of 3.7 (closed 
arrow) and 4.3 kb (open arrow). The complex 
hybridization profiles suggested the presence of 
several introns in a single ACP gene, or multiple 
ACP genes with internal restriction enzyme sites 
on the 18 kb genomic insert. To distinguish 
between these two possibilities, a degenerate 
21-mer oligonucleotide was synthesized (see 
Fig. 1, legend), complementary to the sequence 
coding for residues DTVEIVM, which begin 
immediately carboxy to serine 38, and are highly 
conserved. In most profiles (Fig. IB) the oligonu- 
cleotide hybridized to only two bands which were 
a subset of those that hybridized with the cDNA, 
strongly supporting the conclusion that the 18 kb 
insert contained two linked genes coding for 
ACP. The relative intensity of the hybridizing 
bands was about equal, in contrast to the pattern 
observed when the Brassica cDNA was used as 
a probe, suggesting that the Brassica sequence 
may be more distantly related to one of the two 
genes contained on these fragments. To substan- 
tiate this assessment, a partial restriction enzyme 
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Fig. 1. Detection of two linked genes, A 1 and A2, coding for 
ACP. 

DNA was isolated from a cosmid clone that hybridized with 
a Brassica cDNA coding for ACP. The DNA was digested 
with the following restriction enzymes, as indicated, in 
lanes l)//mdIII,2)i//>idIII/£coRI, 3) Hind I U/Pst 1, 4) ft/I, 
5) Pstl/EcoRl 6) EcoKL, 7) EcoRl/Sacl, 8) EcoRl/Pstl, 9) 
Kpnl/Sacl/EcoRl 10) Kpnl/Sacl, 11) PstljSach 12) Sad, 
13) Sacl/HindUl, 14) Kpnl/Hindll!, 15) Kpnl, 16) 
Kpnl/EcoRh \1) Kpnl/Pstl, and 18) Kpnl/ Sacl. Southern 
blot analysis was -performed using either the radiolabeled 
Brassica cDNA insert (panel A), or a 21-mer oligonucleotide 
(panel B). The sequence of the degenerate oligonucleotide is 
CATNAC(A/G/T)AT(T/C)TC(T/C)ACNGT(A/G)TC. The 
cDNA was hybridized in 3 x SSC (1 x SSC = 0.15 M 
NaCl, 0.015 M Na-acetate) at 48 °C, and washed in 
0.5 x SSC, 0.1% SDS at 55 °C. The oligonucleotide was 
hybridized in .3 x SSC at 42 °C, and washed in 2 x SSC, 
0.1% SDS at 33 C C. In the HindUl and Pstl/Sacl profiles, 



the large arrows point to the fragments carrying part of either 
Al (open) or A2 (closed), as assessed by their relative 
hybridization with the Brassica cDNA. The arrowheads in 
panel A (left side) indicate the position of lambda 
Hind lll/Eco RI markers, with sizes (top to bottom) of 21.7, 
5.15/5.0, 4.27, 3.48, 1.98, 1.9, 1.59, 1.37, 0.94, 0.83, 0.56, 
0.14kb. In panel C, a partial restriction enzyme map is 
shown with the orientation of each gene, and its basic struc- 
ture. In addition to the Hind Ill-Sac I fragment, for gene Al, 
990 nt were sequenced upstream of the internal Hind III site, 
and 185 nt 3'of the Sacl site. For gene A2, 760 nt were 
sequenced 5'of the internal Hind III site, as well as 370 nt 
3'of the Sacl site. The exon boxes represent the transit 
peptide (dotted) and mature (black) regions of the ACP 
precursor. The numbers refer to the approximate size (kb) of 
each intron. The position of the TATA box relative to the 
start codon is indicated in nt. The distances between the Pstl 
and most distal HindlU site are ca. 5.6 kb and 5.7 kb for Al 
and A2, respectively. 
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fragment map was deduced from the Southern 
analysis (Fig. 1C), and selected fragments were 
sequenced. 

Two genes coding for ACP, called Al and A2, 
were identified with their respective putative 
transit peptides (Fig. 2). They are 82% similar at 
the nucleotide level, and show 59% and 63% 
sequence relatedness (restricted to the mature 
protein coding sequence), respectively, to the 
Brassica cDNA initially used to identify them. 
The open reading frame for the mature protein 
encoded by Al or A2 was based on the known 
amino acids of ACP from spinach (Kuo and 
Ohlrogge, 1984). Except for residue 79, which is 
phenylalanine (Al) or leucine (A2), the two 
mature proteins are identical at the amino acid 
level. One other Arabidopsis thaliana (Columbia) 
ACP (Post-Beittenmiller etal, 1989), referred to 
as the product of gene A3, differs significantly 
(23/83 residues), although a net charge of - 9 is 
maintained. 

The amino acid sequence of each transit pep- 
tide was predicted according to four criteria: a) 
the maximum similarity between the two se- 
quences encoded by Al and A2; b) the maximum 
similarity with the putative transit peptide de- 
duced from A3 of Arabidopsis (Post-Beittenmiller 
etal, 1989), and two Brassica cDNA clones 
(Rose etal., 1987; Saffordef al, 1988); c) the fact 
that other reading frames are repeatedly inter- 
rupted by stop codons; and d) adherence to the 
GT/AG rule, and if possible the plant consensus 
sequences for exon/intron donor and acceptor 
sites (Brown, 1986). Starting with the initiator 
ATG, an open reading frame continues for 38 
codons; another extends 5' from the beginning of 
the mature protein for 35 codons (Fig. 2). We 
predict an intron occurs between these two open 
reading frames. Although the start of the intron is 
difficult to predict, at the 3 ' end in both A 1 and A2 
a GCAG : G complies with the acceptor consen- 
sus sequence, and thus probably marks the start 
of the carboxy terminus of the transit peptide. As 
a consequence, the donor site must be in frame, 
and indicates that in A2 the sequence coding for 
the N-terminus of the transit peptide ends at the 
codon for leu 19, which is followed by the only GT 



that does not interrupt a codon. There are three 
potential GT donor sites (see arrows, Fig. 2) in 
Al; the first coincides with the site found in A2, 
following leu 19. Hence, if the coding sequence is 
limited to the residues underlined in Figure 2, the 
transit peptides encoded by Al and A2 are 80% 
related. Typically, they are rich in hydroxylated 
amino acids (serine and threonine) and the basic 
amino acid arginine. 

The sequence analysis indicates that both Al 
and A2 contain three introns in similar positions, 
although each intron differs in length between the 
two genes. Intron one interrupts the sequence 
coding for the transit peptide* as discussed above. 
Intron two (115 nt, Al ; 100 nt, A2) occurs at the 
junction between the transit peptide and the 
mature protein, and may separate the codon for 
the first amino acid, alanine, from the remainder 
of the mature protein. Intron three (324 nt, Al; 
264 nt, A2) begins 3 codons downstream of the 
special serine 38, and contains an internal block 
of ca. 160 nt (starting at nucleotide 1329 in A 1, 
and 1 139, A2) that is ca. 90% similar between the 
two genes. Introns two and three occur in the 
same positions in Al, A2 and A3, as well as in two 
Brassica genes (deSilva etal., 1990). 

A TATA box (TATAAA) is found 100 nt 5 'of 
the ATG in gene Al and 105 nt upstream of A2. 
The ATG is situated within a sequence motif 
(TCTATGGC) that follows the plant consensus 
sequence (ACAATGGC) for translation ini- 
tiation (Joshi, 1987a). At the 3 'end, neither gene 
contains an AATAAA, the consensus sequence 
for polyadenylation found in transcripts from 
animal cells, within 150 nt of their stop codons. 
However, gene A2 has an AATAAT and an 
AATGAA present 232 nt and 259 nt, respec- 
tively, 3 'of its translation stop, similar to that 
observed in other ACP encoding genes (Safford 
etal, 1988) and plant genes in general (Joshi, 
1987b). 

To determine if Al, and possibly A2, is actively 
transcribed, an oligonucleotide was synthesized 
to use as a primer in reverse transcription 
reactions. The oligonucleotide (5 ' -GCGAGGAC- 
GAGCCTTG4AGGG-3 ' ) is the reverse comple- 
ment of the codons for amino acids 11-17 in the 



1 CATAGTAAATTGATGCGTACAAAATGAATGTAAATTGATTACAGACATAA 

5 1 CAGAAATGACAACGTAAACTTTATAACACAATAAAATAGTAATCAAAATT 

101 ATACAA77GCAACGAA7AAAT7T7CTGAAA77G7777CGTA7AAAAAAAA 

151 TTTAAACAGTAAAACATAATTAAATGTAAAAAATCTTTG7GATTTTTTTT 

201 T77G7AAAAAGACGTAGTAGAATTTACAAA77AAAAGGCATAAAA7G7AT 

2 S 1 AACAGATAAAACCATATAAAGTTTCTTAATATTTTCTTAATATTTTGCCT 

301 TTTGTTTCTTGTCATAGGTTTTATTTTCCACCGTCACTGCCAAGCACCAC 

351 TCCACCT AGCTTCTCTGATTCTGTGTCTTGCCGTTCAACTTTC ATTGGAA 

401 TCTCAGATTTTTTATTTTTTTGTTTGTTTTGOTTTGTTGCGAGTTTGTGA 

451 CTATAAAA CCTCTCCCACTTGGTTCTTCACTCTCACTGTTTCTCATCTCT 

501 TCGTCTTCTCCGCCTCCTTCAATCTCACTCCGATCTCTCTACGATTCATT 

MetAlaSerlleAlaAlaSerAlaSerlleSerLeuGlnAlaAr 
551 CGTTC TATGGCTTCCATTGCTGCTTCTGCTTCTATTTCCCTTCAAGCTCG 

gProArgGlnLeuValSerLeuPheProArglleSerHisLeuValLeuL 
601 TCCTCGCCAACTGGTCTCTCTCTTCCCTCGAATCTCTCATCTTGTTTTAC 

X A 

euValLeuLeu 

651 TTGTTTTGCTGT AG ATCTGATCTGATTGATTGTTGTTCTTCTTCTTCAAT 
4 

701 TGTAGATCTAATTTGCAGAATCGTTTTCTGAGTTCTAGGTTGATTTGTTT 

751 AGGTGGCTCTATGATTT AGTTTTCTGGAACTTTGGCTCTGT AG ATCTGAT 

801 GAATCTGTTTTACTATTCTCCGTTGCTAATCACCGGAAT AATTGTTGATT 

851 TCTCGTCGAATTGAATCACAT AGATTTTAGTTGCTTCTGTTT AACTTCTC 

901 TCGTAGATTGTTTCGTTATTTTGGTTTGATGCTTTGCATTACGAGTTTAA 

TrpGlnAlalleAlaAlaSerGlnValLysSerPheSer 
951 C777G7ACTGATGGCA GGCGA7CGCGGC7AG7CAAG77AAAAGC777AGC 

AsnGlyArgArgSerSerLeuSerPheAanLeuArgGlnLeuProThrAr 
1001 AA7GGAAGAAGAAGCAGTCT77C7T77AA7C7CCGCCAGCT7CC7ACCCG 

gLeuThrValSerCyaAla 

CTTGACTGTTTCCTGCGCTGTAAGACTTTTGTTT77TGCTTTCTCGAGCC 



A 2 ! TTTTTTC CGAA777G77A77TC7C7ACCTGCA7C7GCCAAA7CACCCACC 

5 1 7CACC7GAC7TG7C7G7C7GAA77C7C7CA77GCAA7C7CAGAAAAG777 

101 777777T7TTT77G777AGGTTAAAA7A C7A7AAAA 77AAAAA7AAGTC7 

1 S 1 CCCAC77GGATTCT7CAC7G7GTCTCACA7C7C777G7C77C7CCGCC7C 

201 



251 



301 



HecAlaSerZl 

C7CCGATC7CACTCCGATCTCTCTACGATTCAT7CTTC TATGGCT7CCAT 

eAlaThrSerAlaSerThrSerLeuGlnAlaArgProArgGlnLeuValS 
TGCTACT7C7GCTTCTACT7C7C7CCAAGC7CGTCCTCG TCAAC7GG TCT 
- -j 

erLeuPheSerSerSerProArglleSerHisLeuPheLeuThrGluPhe 
C7C7T77C7CTTC77CCCC7CGAA7C7C7CA7CTTT77C7TAC7GAA7TT 



7G77CAACAATGTGA77AGGC7C777GA7ACG777GGGTGA77GG7TGAT 



10S1 
1101 

Ala^ysProGluThrV 

1151 A7A7A7C7CCA77TT777A77G7TG7G7AACAAGGCAAAACC7GAGACAG 

alAapLysValCyaAlaValValArgLysGlnLeuSerLeuLyaGluAla 
1201 7GGACAAGGTG7G7GCAG77G7CAGAAAGCAAC7C7CAC77AAAGAGGC7 

A3pGluIle7hrAlaAla7hrLy3PheAlaAlaLeuGlyAlaAspSerLe 
1251 GACGAAA77AC7GC7GCCACCAAA777GC7GCAC7JGG7GCTGA77CCCT 

uAspThr 

1301 7GA7ACGGTA7AGC77AACAACC77C7CGATG7C7AA7AA77GGA77CAA 

1351 C77TTGAATGGGA7TGTTCA7TGGAC7TAT7GATGC77 AA7CAATGCC7C 

1401 TTAAAAATTGTATCCTGCTTTCAAGCTCATA77G7CGAA7G7T77AGA7T 

1451 GCAAGTCTTGT AAAGGGTCGCTGTGTCTCCT7A7ACGCAGC7CTT7 AG7A 

1501 T7GT7ACTGATACAGGCAAA7ATA7AGGACAT AC7CAAACACATACT7CA 

1551 CAAGCAA7GCATT7A7CAAA77AAATTCAAAATGGATTCGAACGTG7GCA 

ValGluIleValMetGlyL 
1601 CA7AC7CATGAC7CGGCG777TGGA777CAGG7GGAGAT7G7GA7GGGAC 

euGluGluGluPheGlylleGluMetAlaGluGluLysAlaGlnSerlle 
1651 7 AG AGGAAGAGTTCGGGA7TGAAA7GGCGGAGGAGAAAGC ACAGTCAATC 

AlaThrValGluGlnAlaAlaAlaLeuIleGluGluLeuLeuPheGluLy 
1701 GCCACAGT7GAGCAAGCAGC7GCGC7CA77GAGGAGCTC77G777GAAAA 

sAlaLys 

17 51 GGCCAAG7 AGAA7 A7CT7TATTACATT AGCGAAAAAC AAAAAAA7C AAAA . 

1801 ACCCAAAACCAC7A7CTTATTGTT7CGTTAGCTAGAGAGCAA7TGTGTCT 

1851 GTT AAAGAT7TT A7GTTATTT7GGGGAAA77 AT7ACAAGGCT7G7 A7CTA 

1901 C7TT7ATCT77CT7C77AAACCATTT7 1927 

Fig. 2. Nucleotide sequences of ACP genes Al (left) and A2 (right). The predicted open reading frame (see text) of each gene is given in the three-letter amino acid code 
above the nucleotide sequence determined by the dideoxy chain termination method (Sanger et al. t 1977). A putative transit peptide sequence is underlined; the small and 
medium-sized arrows below the line indicate the possible donor and acceptor sites, respectively, of intron one (see text). The large arrow 5' of the initiator ATG indicates 
the transcription start site based on primer extension analysis. The TATA box and potential polyadenylation signals are underlined. The asterisk denotes the start of a 

highly conserved 5' region between Al and A2. 



LeuSer 

351 77G7CGTGA77TC7CCCA7777GC7A7AGA7C7GA7C7GA77CA7C7CGA 

4 01 CC7AG77G77CT7C77C77C77CAAC7GTAGA7C7 AA77CGCAGA77CG7 

4 51 77TC7CA7CTCTA7AT7GAC7G77GAAA77GC7C7G7AA77CATTT777G 

501 7CC7G7AA777TTT7CAT7GAGG7AAGT7AGT777CG77777C7GAATT7 

551 7AGGC7C7G77GAACC7A7T77AC77AAC7CGG7CGAA7CACCGGAAAAA 

601 77A7CC7ATTCC7C77CGAA77GGA777G7777A7A7C7GAA777GAA7A 

6 5 1 GA77CGAAC7C77GG7AAGAGCA7GCGGG77ACA7AGA777T7AGTTGCT 

7 01 777G777CC77C7777GG777G777CAG7GCAAA7A7AT7GC7T7GCATT 

7 51 7GCA7TTGGAGTT7AG7T7TG7AC7GA7GGAAAG77A7A77CGAA7A7GA 

7rpGlnValIleGlyAlaLy3GlnValLy3SerPheSer7yrGlySerAr 
8C1 7GGCA GG7GA77GGGGC7AAACAAG77AAAAGC777AGC7A7GGAAGCAG 

gSerA3nLeuSerPheA3nArgArgGlnLeuPro7hrArgLeuThrVal7 
851 AAGCAA7C777CT777AA7CGGCGCCAGC77CC7ACCCGC7TGAC7G7T7 

yrCyaAla 

901 AC7GCGC7G7AAGACC77G7CTCT7C7C7CA7A7A7GT7GTT7C7A7A7T 

951 77AC77T7GA7C7C7CGAGCCAA77CA7CA77CACCA77G77GA77G7TG 

Ala LyaP roGlu7hrVal AapLy s Va ICy sAlaVa lValArg 
1001 A7AAACAGGCAAAACC7GAGACAG7GGACAAGG7G7G7GCAG77G7CAGA 

Ly3GlnLeuSerLeuLysGluAlaA3pGluIleThrAlaAla7hrLy3Ph 
1 0 5 1 AAGC AAC7C7C AC7C AAAGAGGC 7GACG AAA7 7 ACCGC7 GCCACCAAA7T 

eAlaAlaLeuGlyAlaAspSerLeuA3p7hr 
1101 7GC7GCAC77GGTGC7GA77CCC7TGA7ACGG7AAC7AC777C7CA7G7C 

1151 7AA7AA77GGA77CAAC7777GAAA7GGGA77G77CA77GGAC77A77GA 

1201 7GCA7AA7CAA7GCCTC777AAAC77G7A7CC7GC777CAAGC7CA7A77 

1251 G7CGAA7G777AGA77GCAAG7C77G7GAAGGA7CGCTGC77C7CC77A7 

1301 ACACACA7C7C7AAACA7G77AC7GAGA7ACACA7AC7CA7GA77C77G7 

ValGl 

12 51 7A7A77G77A7GGACG7AC7CA7GC7TCG7G7G7CGGGGT7CCAGGTGGA 

uIleValHetGlyLeuGluGluGluPheGlylieGluMetAlaGluGluL 
14 01 GA7AG77A7GGGA77AGAGGAAGAG7TTGGGAT7GAAA7GGCGGAGGAGA 

y3AlaGlnSerIleAla7hrValGluGlnAlaAlaAlaLeuIleGluGlu 
1451 AAGCACAG7C7A7CGCCACAG77GAGCAAGCAGC7GCAC7CA77GAGGAG 

LeuLeuLeuGluLysAlaLys 
1501 C7C77G77GGAAAAGGCCAAG7AGAAAA7CC777AC7AC77AGCAAAAAA 

1551 CGGAAAAAAACCAAAACCCAAACCAC7A7CTTC7TA77GG7T77GG77AG 

1601 C7 AG AGAGCAA7TCG7CGTCCG77GAGA777T AA7GGC7A7TTTCGGAT A 

1651 A77A7TACAAGGC7TG7A77T7AATA77TAC777CG77CTGAAACATT7T 

1701 ACTTTCAGG7TTAAAA7A77AG7AC777CAGG7AT7CAAACA7AAG7T77 

17 51 AAA7AA7A7AAT7A7GG77GATGAC77CT7AA7GAAAGGATT7GAG77TC 

1801 77GACA7TTT77TGT7TCT77A7AA7CTAGAAG7777G7TGTCCGAAGAT 

1851 7CC AAATTC AGGGG AAGGGGC A7 AAA 187 6 
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transit peptide (SLQARPR) of gene Al, and 
differs in only three positions (underlined) from 
the analogous sequence of A2. The oligonucleo- 
tide sequence is not found in A3 (Post- 
Beitenmiller et al, 1989). RNA was isolated from 
young Arabidopsis plants grown on MS/sucrose 
medium and hybridized (0.2 mg mL" 1 at 21 °C) 
with the radiolabeled oligonucleotide (2 x 10 6 
CPM pmol" *). Following primer extension with 
M-MLV reverse transcriptase, the hybrids were 
digested with RNase as described (Sambrook 
etal, 1989), and immediately analyzed by dena- 
turing gel electrophoresis. A 123 nt RNase-resis- 
tant band was found (not shown). These results 
indicate that transcription begins 72 nt upstream 
of the ATG codon in Al, and possibly A2 as well, 
although a stable hybrid between the primer and 
A2 transcripts may not be favored. The tran- 
scription start site places the TATA box of gene 
Al 28 nt upstream (and potentially 34 nt up- 
stream for A2). The 5'nontranslated regions of 
Al and A2 are 96% related and rich in pyri- 
midines. 

The 5' flanking regions of genes Al and A2 
both contain a highly conserved domain be- 
ginning ca. 95 nt from the transcription start with 
the sequence TCATTGG AATCTCAGA, follow- 
ed by at least 20 thymidines, occasionally punc- 
tuated by a purine. This sequence is not found 
5' of the A3 gene, which codes for a different 
isoform of ACP (Post-Beitenmiller et al, 1989). It 
is interesting to speculate that the sequence is 
important in selectively regulating the expression 
of Al and A2. This question will be most fruitfully 
addressed by the analysis of transgenic plants 
carrying a reporter gene under the control of either 
the Al or A2 promoter. 
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was digested with Eco Rl, fractionated on 0.7% 
agarose, blotted onto Hybond-N (Amersham), 
and hybridized as described [J. Sambrook, A. 
Friscri, T. Maniatis, Molecular Cloning: A Labora- 
tory Manual (Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY, 1989)]. 
16. Protoplasts from N. tabacum SR . 

S.-Breznovits, L. Marton, F. Joo, Claim O / 

(1975)] were isolated [J. Drap FattV 

Armitage, R. Walden, Plant Gen J 

tion and Gene Expression: A La 

(Blackwell, Oxford, 1988)] and c 

Agrobacterium that contained pP 

described [A. Depicker et a/., 

201, 477 (1985)]. Protoplasts were cultured in the 

presence of hygromycin (15 n-g/ml) (Boehringer 

Mannheim) and Claforan (500 jtg/ml) (Hoechst, 



Frankfurt. Germany) with either cytokinin (kinetin, 
0.2 mg/l) (Sigma) and auxin (1-naphthaleneacetic 
acid, NAA, 1 .0 mg/l) (Sigma) or the same media 
lacking auxin. Calli growing in the absence of 
auxin were cultured further to form plants. 
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Map-Based Cloning of a Gene Controlling Omega-3 
Fatty Acid Desaturation in Arabidopsis 

Vincent Arondel, Bertrand Lemieux, Inhwan Hwang, Sue Gibson, 
Howard M. Goodman, Chris R. Somerville* 

A gene from the flowering plant Arabidopsis thaliana that encodes an omega-3 desaturase 
was cloned on the basis of the genetic map position of a mutation affecting membrane and 
storage lipid fatty acid composition. Yeast artificial chromosomes covering the genetic 
locus were identified and used to probe a seed complementary DNA library. A comple- 
mentary DNA clone for the desaturase was identified and introduced into roots of both 
wild-type and mutant plants by Ti plasmid-mediated transformation. Transgenic tissues of 
both mutant and wild-type plants had significantly increased amounts of the fatty acid 
produced by this desaturase. 



The small crucifer Arabidopsis thaliana (L.) 
is suitable for the application of map-based 
cloning methods because it has a small 
nuclear genome that is almost devoid of 
interspersed, highly repetitive DNA (1). 
The five chromosomes have a total DNA 
content of about 70,000 kb (1). The aver- 
age distance from any gene to the nearest 
restriction fragment length polymorphism 
(RFLP) marker (2, 3) is about 225 kb, and 
several yeast artificial chromosome (YAC) 
genomic libraries are available for Arabidop- 
sis (4, 5). Here, we used these resources to 
isolate a gene from Brassica napus that 
complements a mutant of Arabidopsis defi- 
cient in omega-3 unsaturated fatty acids. 

Fatty acid desaturases catalyze the 0 2 and 
electron donor-dependent insertion of dou- 
ble bonds into fatty acids. Because the 
amount of membrane lipid unsaturation af- 
fects the physical properties of membranes, 
the desaturases are thought to affect the 
ability of plants to survive extreme temper- 
atures (6). Also, the nutritional quality of 
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edible plant oils is largely determined by the 
composition of storage triacylglycerols. Ex- 
cept for the stearoyl-ACP desaturase, plant 
desaturases are thought to be membrane 
proteins and have been difficult to charac- 
terize by conventional biochemical methods 
(7). Information about the number and 
properties of the various desaturases in Ara- 
bidopsis has been obtained by the isolation of 
an extensive collection of mutants with 
altered membrane and storage lipid unsatur- 
ation (8). One of the mutations, designated 
/ad3, resulted in reduced accumulation of 
linolenic acid (lS^ 3 - 6 * 9 ), and a corre- 
sponding increase in the amount of linoleic 
acid (181V* 6 ' 9 ), in extrachloroplast mem- 
brane and storage lipids (9). These metabol- 
ic effects suggested that the fad3 locus en- 
coded an co3 linoleate desaturase. 

The fad3 locus was genetically mapped 
by scoring the fatty acid composition of 137 
progeny (F2) of a cross between plants of 
the fad3 mutant line BL1 (Landsberg race) 
(9) and wild type (Niederzenz race). Be- 
cause the fad3 phenotype is only weakly 
evident in chlorophyllous tissues, but is 
strongly expressed in root or seed tissue, the 
fatty acid phenotype of each F2 plant was 
scored by gas chromatographic analysis of 
the fatty acid composition of each of ten 
seeds obtained by self-fertilization of F2 
plants. The RFLP genotype of F2 progeny 
was determined by analysis of DNA prepa- 



rations from F2 plants and F3 families (2, 
3). The fad3 mutation mapped on chromo- 
some 2 adjacent to the RFLP markers M220 
(2) and ASA2 (10) (Fig. 1). When used to 
screen YAC libraries, M220 hybridized with 
YAC EG4E8 (4) and ASA2 hybridized with 
YACs EW7D11 and EW15G1 (5). The 
YACs were ordered by analysis of hybridiza- 
tion of end-specific probes on Southern 
(DNA) blots of the YAC clones. The result- 
ant, approximately 340 kb, contig of four 
YACs extended a minimum of 170 kb to the 
right of ASA2 (Fig. 1). As the maximum 
distance between M220 and ASA2 was 180 
kb (the sum of the inserts in EG4E8 and 
EW15G1), we estimated that 1 centimorgan 
(cM) was equivalent to less than an average 
value of 105 kb in this region. As fad3 was 
0.4 cM to the right of ASA2, we estimated 
that the YAC contig extended far enough to 
the right to include fad3. 

Stearoyl-ACP desaturase, an enzyme 
that catalyzes a chemically equivalent reac- 
tion, is encoded by a moderately abundant 
mRNA in developing seeds of oil-accumu- 
lating plants such as Ricinus communis (11). 
Therefore, we used the YACs to isolate 
moderately abundant cDNA clones. DNA 
from one YAC, EW7D11, was isolated 
from a low-melting agarose-clamped homo- 
geneous electric field (CHEF) gel (12) and 
used to probe a Xgtll cDNA library made 
from developing seeds of the closely related 
crucifer Brassica napus. The B. napus library 
was used because we did not have a suitable 
library from developing seeds of Arabidopsis, 
and genes from B. napus are highly homol- 
ogous to the corresponding Arabidopsis genes 
(13). Of 31 positive plaques among 2 X 10 5 
screened, 17 cross-hybridized at high strin- 
gency and, therefore, appeared to be deriv- 
atives of the same gene. None of the other 
clones were highly represented among the 3 1 
positive clones. The largest insert (1.4 kb) 
representing the abundant transcript was 
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Fig. 1. Genetic map of the region of chromo- 
some 2 (thick line) that contains the fad3 locus. 
The YACs that correspond to this region of the 
genome are shown below. The sizes of the 
inserts (in parentheses), were determined by 
pulsed-fieid gel electrophoresis (12). Arrows 
indicate regions of overlap demonstrated by 
hybridization of end-specific probes. Scales of 
genetic distance (centimorgans, cM) and kilo- 
bases (kb) are indicated. 
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Table 1. Fatty acid composition of transgenic roots. The transgenic roots resulting from infection of 
the fad3 mutant or wild type with A tumefaciens R1000 carrying only the vector (pBI121) or the 
vector plus cDNA (pTiDES3) were grown in the presence of kanamycin (50 jtg/ml) for 3 weeks to 
identify roots that had been co-transformed with one of these plasmids and 200 p.g of cefotaxime 
to prevent growth of bacteria. The fatty acid composition of samples of roots (about 30 mg per 
sample) was determined as described (23). Abbreviations: 16:0, palmitic acid; 16:1, palmitoleic 
acid; 18:0, stearic acid; and 18:1, oleic acid; 18:2, linoleic acid; 18:3, linolenic acid. The values 
presented are the mean ± SD (n = 12). 



Fatty acid 

Genotype — " 

16:0 16:1 18:0 18:1 18:2 18:3 

Percentage of total fatty acids 

Wild type 22.0 ± 2.9 2.5 ± 0.7 2.3 ± 1.9 3.8 ± 1 .3 37.3 ± 3.7 31 .9 ± 4.5 
pBM21 

fad3 21 .2 ± 1 .6 1.6 ± 0.8 2.3 ± 1 .5 5.9 ± 2.6 62.2 ± 5.9 6.7 ± 0.7 
. pBI121 

fad3 21.3 ± 2.3 1.5 ± 0.2 1.6 ± 0.4 9.1 ± 2.0 24.4 ± 14.9 42.1 ± 15.5 
pTiDES3 

Wild type 21.1 ±0.9 2.0 ± 0.1 1.9 ±0.2 7.7 ± 2.0 15.7 ±11.7 51.3 ±10.9 
pTiDES3 



subcloned into pBluescript (Stratagene) to 
produce plasmid pBNDES3. 

In order to test if the cDN A in pBNDES3 
was encoded by the B. napus equivalent of the 
fad3 locus, a genetic complementation test 
was conducted. Because the fad3 mutation 
has a large effect on the fatty acid o>mposition 
of Ardbidopsis roots (9), we exploited the fact 
that large numbers of rooty tumors can be 
rapidly produced by infection with Agrobacte- 
nwrn mnefadens R1000, which carries an Ri 
plasmid from Agmbacterium rhizogenes instead 
of a Ti plasmid (14)* The cDNA insert from 
pBNDES3 was inserted into the binary Ti 
vector pBI121 (Clontech Laboratories, Palo 
Alto, California) under transcriptional con- 
trol of the constitutive cauliflower mosaic 
virus (CaMV) 35S promoter, to produce plas- 
mid pTiDES3. After electroporation of 
pTiDES3 into A. tumefaciens R1000, the bac- 



10 20 30 40 

MWAMDQRSN VNGDSGARKE EGFDPSAQPP FKIGDIRAAI 
50 60 70 80 

FKHCWVKSPL RSHSYVTRDI FAVAAIAMAA VYFDSWFLWP 
90 100 110 120 

LYHVAQGTLF WAIFVLGHDC GHGSFSDIPL LNSWGHILfl 

130 140 150 160 

SFILVPYHGW RISHRTHHQN H6HVENDESW VPLPEKLYKN 

170 180 190 200 

LPHSTHMLRY TVPLPMIAYP IYLWYRSPGK EGSHFHPYSS 

210 220 230 240 

LFAPSERKLZ ATSTTCWSIM IATLVYLSFL VDPVTVLKVY 

250 260 270 280 

GVPYIIFVMW LOAVTYLHHH GHDEKLPWYR GKEWSYUtGG 

290 300 310 320 

LTTIDRDYGI FNNIHHDIGT HVIHHLFPQI PHYHLVDATR 

330 340 350 360 

AAKHVUGRYY REPKTSGAIP IHLVBSLVAS IKKDHYVSDT 

370 380 
GDIVFYETDP DLYVYASDKS KIN 

Fig. 2. Deduced amino acid sequence of the 
protein encoded by the fad3 cDMA. The nucleo- 
tide sequence has been deposited in the Gen- 
Bank database (L01418) and the clone is avail- 
able from the Arabidopsis Biological Resource 
Center (22). Abbreviations for the amino acid 
residues are: A Ala; C, Cys; D, Asp; E, Glu; F, 
Phe; G, Gly; H, His; I, He; K, Lys; L, Leu; M, Met; N, 
Asn; P, Pro; Q, Gin; Ft, Arg; S, Ser; T, Thr; V, Val; 
W, Trp; and Y, Tyr. 



teria were used to induce rooty tumors on 
stem explants from the fod3 mutant and wild- 
type Arabidopsis plants (15). More than 50% 
of the rooty tumors produced in this way 
contain the binary Ti plasmid and are, there- < 
fore, kanamycin-resistant (16) . After 5 weeks, 
the roots were excised from the stem explants, 
cultured for three more weeks, and the fatty 
acid composition of total lipid extracts mea- 
sured (Table 1). The fad3 mutant transformed 
with only the vector had less 18:3 and more 
18:2 fatty acid than did the wild type. Trans- 
formation of the fad3 mutant with pTiDES3 
produced roots that contained greater than 
wild-type amounts of 18:3 and lower amounts 
of 18:2. Transformation of the wild type with 
pTiDES3 resulted in even more 18:3. Thus, 
the B. napus cDNA in pTiDES3 functionally 
complements the fad3 mutation in Arabidop- 
sis, and the proportion of unsaturated lipids 
can be altered by affecting transcription of a 
putative desaturase gene. 

Analysis of the nucleotide sequence of the 
cDNA insert in pBNDES3 revealed a 383- 
amino acid open reading frame that encodes a 
44-kD polypeptide (Fig. 2). The NH 2 - termi- 
nal region does not exhibit the characteristics 
of a signal peptide (17) but the COOH- 
terminus contained the Lysyl residues three 
and five amino acids from the end that have 
been shown to be sufficient in animals for 
retention of membrane proteins in the endo- 
plasmic reticulum (ER) membrane (18). Sev- 
eral strongly hydrophobic internal domains 
could be transmembrane domains. These 
characteristics suggest that the pBNDES3 
cDNA encodes a membrane-bound protein 
located in the ER, which is in agreement with 
the available biochemical evidence concern- 
ing the localization of the 0)3 desaturase en- 
coded by the fad3 locus (9). Comparison of 
the deduced amino acid sequence with the 
protein sequences contained in GenBank re- 
lease 70 using the FASTA program (19) 



indicated homology with the a>6 desaturase 
from the cyanobacterium Synechocystis (20), 
including a 12-residue sequence of which 10 
residues were identical. No homology with 
other proteins could be detected. The se- 
quence homology between the cyanobacterial 
0)6 desaturase and the fad3 gene product raises 
the possibility that the 0)3 and o>6 desaturases 
in higher plants may also have significant 
structural siinilarity. 

Note added in proof: J. Browse, N. Yadav, 
and collaborators have cloned the Arabidop- 
sis fad3 gene by T-DNA tagging (21). 
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Recognition of the 5 ' exon for splicing of the 
ribosomal RNA precursor of T. thermopMa 
involves base pairing of the exon sequence 
CUCUCU with part of an intron internal 
guide sequence (IGS), GGAGGG, to give a 
helix designated PI (1). This process can be 
mimicked with oligonucleotides and trun- 
cated forms of the intron (2, 3). Tertiary 
interactions involving 2' -OH groups of sub- 
strate enhance this binding (4-6). Under 
conditions where all the ribozyme is active 
(6), we report transient kinetic studies in- 
dicating that tertiary contacts form after 
base pairing, and we provide the first rate 
constants for the dynamics of this RNA 
folding step (Fig. 1 A) . 

Conjugation of pyrene to a 5' amino- 
modified ribose (Fig. IB) provides a probe 
of rapid binding steps (7). Binding of pyr- 
CUCU, pyrCCUCU, pyrCUCUCU, and 
pyrCCCUCU to the L-21 Sea I form of the 
ribozyme from T. thermophila (3) increases 
pyr fluorescence by factors of 25, 21, 8, and 
4, respectively (8), consistent with expec- 
tations based on three-dimensional models 
of the binding site (9). All four pyr-labeled 
substrates reacted in single turnover nucle- 
otidyl transfer reactions with 32 P-labeled 
(p*) p*UCGA, suggesting that their fluo- 
rescence enhancement results from binding 
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with L-21 Sea I and pyrCCUCU resulted 
in fast and slow rates for binding, 1/t! 
and 1/t 2 , respectively (Fig. 2). Traces of fluo- 
rescence versus time after mixing were fit 
to a single or double exponential as appro- 
priate. Plots of rates versus [pyrCCUCU] 
give a straight line fit for the faster rate 
and a hyperbolic .fit for the slower rate 
(Fig. 3A). This is consistent with two- 
step binding in which base pairing of 
pyrCCUCU and the IGS to form PI 
occurs in the first step and uptake of PI 
occurs in the second step (Fig. 1A, 
Scheme I): The apparent enhancement of 
pyr fluorescence after tertiary folding is a 
unique observation for a nucleic acid (7). 
For Scheme I and substrate, S, in excess 
over L-21 Seal (II) 



Dynamics of Ribozyme Binding of Substrate 
Revealed by Fluorescence-Detected 
Stopped-Flow Methods 

Philip C. Bevilacqua, Ryszard Kierzek, Kenneth A. Johnson, 

Douglas H. Turner* 

Fluorescence-detected stopped-flow and equilibrium methods have been used to study the 
mechanism for binding of pyrene (pyr)-labeled RNA oligomer substrates to the ribozyme 
(catalytic RNA) from Tetrahymena thermophila. The fluorescence of these substrates 
increases up to 25-fold on binding to the ribozyme. Stopped-flow experiments provide 
evidence that pyr experiences at least three different microenvironments during the binding 
process. A minimal mechanism is presented in which substrate initially base pairs to 
ribozyme and subsequently forms tertiary contacts in an RNA folding step. All four mi- 
croscopic rate constants are measured for ribozyme binding of pyrCCUCU. 



in the catalytic core (10). A typical reac- 
tion modeling the second step of splicing is 

pyrCCUCU + p*UCGA ^ 
5'exon 3'splice L-21 

mimic junction Sea I 

pyrCCUCUA + p*UCG 
5'splice 3'end 
junction of intron 

Rapid mixing, stopped-flow experiments 



_ ~ ^[Sl + fc-i + kz + k-i (1) 



I fcl[S](fe 2 + fe-2) + fe-lfe-2 



(2) 



where fe^ fe_i, k lf ancl k-2 are the rate con- 
stants shown in Scheme I. Initial estimates 
of rate constants were obtained by fitting 
data in Fig. 3A to Eqs. 1 and 2. Rate 
constants were optimized by computer sim- 
ulation of traces (Table 1) (12). The value 



pyflbCUCl/n 




^.=^=130 nM 
^=3.9x1f/M- 1 S' 1 




i pv??V9V * / 



L-21 Sea I Ribozyme 



Base-paired 
Intermediate 



^=0.008 ^-1-1 "M 



Final complex 



B 



pyrC 



u6^^c^ ^ ^ yAGGAGGs 

Fig. 1 . (A) Minimal mechanism (Scheme I) consistent with all the data for all four 5' exon mimics. The 
rate constants are for pyrCCUCU. The sketch of L-21 Sea I is not meant to give structural detail but 
to indicate that pyr is protected from solvent in both the intermediate and final state and less so as 
oligomer length increases. The sketch also depicts GGAGGG (IGS) as not completely accessible 
in the unbound L-21 Sea I. Unes indicate base pairing (7). Known tertiary hydrogen bonds involving 
2' OH groups of substrate (5, 6) are indicated by bold dots. (B) Structure of the 5' end of 
pyr-modified oligomers (pyrC). (C) Minimal mechanism for pyrCUCU and pyrCCUCU binding to the 
IGS mimic GGAGGA. The rate constants are for pyrCCUCU. 
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^ ® Cloning and expression of microbial phytase. 



© A nucleotide sequence encoding phytase has been isolated and cloned. The coding sequence has been 
Inserted into an expression construct which in turn has been inserted into a vector capable of transforming a 
microbial expression host The transformed microbial hosts may be used to economically produce phytase on 
an industrial scale. The phytase produced via the present invention may be used in a variety of processes 
requiring the conversion of phytate to inositol and inorganic phosphate. 
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CLONING AND EXPRESSION OF MICROBIAL PHYTASE 



The present invention relates to the microbial production of phytase. 



Background of th© Invention 

Phosphorus is an essential element for the growth of all organisms. In livestock production, feed must 
be supplemented with inorganic phosphorus in order to obtain a good growth performance "of monogastric 
animals (e.g. pigs, poultry and fish). 

In contrast, nc inorganic phosphate needs to be added to* the feedstuffs of ruminant animals. 
Microorganisms, present in the rumen, produce enzymes which catalyze the conversion of phytate (myo- 
inositolhexakis-phosphate) to inositol and inorganic phosphate. 

Phytate occurs as a storage phosphorus source In virtually all feed substances originating from plants 
(for a review see; Phytic acid, chemistry end applications , E. Graf (ed.), Pilatus Press; Minneapolis, MN, 
U.SA (1986)). Phytitecbmprises 1-3% cf all nuts, cereals, legumes, oil seeds, spores and pollen. Complex 
salts of phytic acid are termed phytin. Phytic acid is considered to be an anti-nutritional factor since it 
chelates minerals such as calcium, zinc, magnesium, iron and may also react with proteins, thereby 
decreasing the bioavailability of protein and nutritionally important minerals. 

Phytate phosphorus passes through the gastro-intestina! tract of monogastric animals and is excreted in 
the manure. Though some hydrolysis of phytate does occur in the colon, the thus-released inorganic 
phosphorus has no nutritional value since inorganic phosphorus is absorbed only in the small intestine. As a 
consequence, a significant amount of the nutritionally important phosphorus is not used by monogastric 
animals, despite its presence in the feed. 

The excretion of phytate phosphorus in manure has further consequences. Intensive livestock produc- 
tion has increased enormously during the past decades. Consequently, the amount of manure produced has 
increased correspondingly and has caused environmental problems in various parts of the world. This is 
due. in part, tc the accumulation of phosphate from manure in surface waters which has caused 
eutrophication. 

The enzymes produced by microorganisms, that catalyze the conversion of phytate to inositol and 
inorganic phosphorus are broadly know* as phytases. Phytase producing microorganisms comprise 
bacteria such as Bacillus subtilis (V.K. Paver and V.J. Jagannathan (1962) J. Bacterid. 151^ . 1102-1108) 
and Pseudonomas (D.J. Coigrove (1970) Austral. J. Biol. Sci. 23 , 1207-1220); yeasts such as Sao 
charomyces cerevisiae (N.R. Nayini and P. Markakis (1984) Lebersmittei Wissenschaft und Technologie 17 
, 24-26); and fungi such as Aspergillus terreus (K. Yamads, Y. Minoda and S. Yamamoto (1986) Agric. Biol. 
Chem. 32 , 1275-1282). Various other Aspergillus species are known to produce phytase, of which, the 
phytase"produced by Aspergillus ficuum has been determined to possess one of the highest levels of 
specific activity, as well as having better thermostability than phytases produced by other microorganisms 
(unpublished observations). 

The concept of adding microbial phytase to the feedstuffs of monogastric animals has been previously 
described (Ware. J.H., Bluff, L and Shieh, T.R. (1967) U.S. Patent No. 3,297,548; Nelson, T.S.. Shieh. T.R., 
Wodzinski, R.J. and Ware, J.H. (1971) J. Nutrition W , 1289-1294). To date, however, application of this 
concept has r.ot been commercially feasible, due to the high cost of the production of the microbial 
enzymes (Y.W. Han (1989) Animal Feed Sci. & Techno!. 24 , 345-350). For economic reasons, inorganic 
phosphorus is still added to monogastric animal feedstuffs. 

Microbial phytases have found other Industrial uses as well. Exemplary of such utilities is an industrial 
process for the production of starch from cereals such as corn and wheat. Waste products comprising e.g. 
corn gluten feeds from such a wet milling process are sold as animal feed. During the steeping process 
phytase may be supplemented. Conditions (T - 50* C and pH = 5.5) are ideal for fungal phytases (see e.g. 
European Patent Application 0 321 004 to Alko Ltd.). Advantageously, animal feBds derived from the waste 
products of this process will contain phosphate instead of phytate. 

It has also been conceived that phytases may be used In soy processing (see FinaseTM Erreymes By 
Alko , a product information brochure published by Alko Ltd., Rajamfiki, Finland) ♦ Soybean meal contains 
R^Fi levels of the anti-nutritional factor phytate which renders this protein source unsuitable for application 
in baby food and feed for fish, calves and other non-ruminants. Enzymatic upgrading of this valuable protein 
source improves the nutritional and commercial value of this material. 



EP 0 420 368 A1 



Other researchers have become Interested In better characterizing various phytases and improving 
procedures for the production and use of these phytases. Ullah has published a procedure for the 
purification of phytase from wild-type Aspergillus f'cuum , as well as having determined several biochemical 
parameters of the product obtained by this purificiBon" procedure (Ullah. A. (1988a) Preparative Blochem. 
18 443-458). Pertinent data obtained by Ullah Is presented in Table 1. below. 

~ The amino acid sequence of the N-termlnus of the A. Scuum phytase protein has twice been disclosed 
by Ullah- Ullah, A. (1887) Enzyme and Engineering conference IX. October 4-8. 1887, Santa Barbara, 
California (poster presentation); and Ullah, A. (1988b) Prep. Blochem. 18 . 459-471. The amino acid 
sequence data obtained by Ullah is reproduced In Figure 1 A, sequence E. below. 

Several interesting observations may be made from the disclosures of Ullah. First of all, the purified 
preparation described in Ullah (1988a and 1988b) consists of two protein bands on SDS-PAGE We have 
found however, thet phytase purified from A. fjcuum contains a contaminant and that one of Ihe bands 
found on SDS-PAGE; identified by Ullah as a phytase. is originating from this c^amlnant. 

This difference is also apparent from the amino add sequencing data published by Ullah (1987. 1888b, 
compare Figure 1A. sequences A and B with sequence C). We have determined. In fact, that one of the 
amino acid sequences of Internal peptides of phytase described by Ullah (see figure 1B, sequence E) 
actually belongs to the contaminating 100 KDa protein (Figure 1C) which is present >rr ^ Preparafion 
obtained via the procedure as described by Ullah. and seen as one of the two bands on SDS-PAGE (Ullah 
1988a and 1988b). Ullah does not recognize the presence of such a contaminating protein, and instead 
identifies it as another form of phytase. The presence of such contamination, in turn, increases the difficulty 
in selecting and isolating the actual nucleotide sequence encoding phytase activity. Furthermore, the 
presence of the contamination lowers the specific activity value of the protein tested. 

Further regarding the sequence published by Ullah, it should be noted that the ammo ac.d residue > at 
position 12. has been disclosed by Ullah to be glycine. We have consistently found using protein and DNA 
sequencing techniques, that this residue Is not a glycine but is in fact a cysteine (see Figures 1 6 and 8). 

Finally Ullah discloses that phytase Is an 85 kDa protein, with a molecular weight after deglycosylation 
of 61 7 kDa (Ullah 1888b). This number, which is much lower than the earlier reported 76 kDa protein 
(Ullah A. and Gibson. D. (1988) Prep. Blochem. 17(1) . 63-91) was based on the relative amount of 
carbohydrates released by hydrolysis, and the apparent molecular weight of the native protein on SDS- 
PAGE. We have found, however, that glycosylated phytase has a single apparent molecular we.ght [of 85 
kDa. while the deglycosylated protein has an apparent molecular weight In the range of 48 ■ 56.5 kDa. 
depending on the degree of deglycosylation. 

Mullaney et al . (Filamentous Fungi Conference. AprD. 1987. Pacific Grove. California (poster presenta- 
tion) also disclbie the characterization of phytase from A. fjcuum . However, this report also contains 
mention of two protein bands on SDS-PAGE. one of 85 kDa, and one of 100 kDa, which were present .n the 
"purified" protein preparation. These protein bands are both identified by the authors as being forms of 
phytase. A method for transforming microbial hosts is proposed, but has not been reported. The cloning 
and isolation of the DNA sequence encoding phytase has not been described. 

It will be appreciated that an economical procedure for the production of phytase will be of significant 
benefit to, inter alia, the animal feed industry. One method of producing a more economical phytase would 
be to use recombinant DNA techniques to raise expression level* of the enzyme in various microorganisms 
known to produce high levels of expressed peptides or proteins. To date, however, the isolation and cloning 
of the DNA sequence encoding phytase activity has not been published. 



Summary of the Invention 



The present invention provides a purified and isolated DNA sequence coding for phytase. The isolation 
and cloning of this phytase encoding DNA sequence has been achieved via the use of specific 
oligonucleotide probes which ware developed especially for the present invention. Preferred DNA se- 
quences encoding phytases are obtainable from fungal sources, especially filamentous fungi of the genus 

Aspergillus . _ , . . , 

itlilnother object of the present invention to provide a vector containing an expression construct which 

further contains at least one copy of at least one. preferably homologous DNA sequence encoding phytase. 
operabty linked to an appropriate regulatory region capable of directing the high level expression of 
peptides or proteins having phytase activity in a suitable expression host 

The expression construct provided by the present invention may bo inserted into a vector, preferably a 
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plasrnld, which is capable of transforming a microbial host cell and integrating into the genome. 

It is a further object of the present invention tc provide a transformant. preferably, a microbial host 
which has been transformed by a vector as described In the preceding paragraph. The transformed hosts 
provided by the present invention are filamentous fungi of the genera Aspergillus , Trichoderma , Mucor 
and Penlcillium , yeasts of the genera Kluyveromyces and Saccharomyces or bacteria of the genus Bacillus 
. Especially preferred expression hosts are filamentous fungi of the genus Aspergillus . The transformed 
hosts are capable of producing high levels of recombinant phytase on an economical, industrial scate. 

In other aspects, the Invention Is directed to recombinant peptides and proteins having phytase activity 
In glycosylated or unglycosylated form; to a method for the production of said ungiycosylated peptides and 
proteins; to peptides and proteins having phytase activity which are free of impurities; and to monoclonal 
antibodies reactive with these recombinant or purified proteins. 

A comparison of the biochemical parameters of the purified wild-type X. ficuum phytase as obtained by 
Ullah, against the further purified wild-type A ficuum phytase, obtained via the present invention, is found in 
Table 1, below. Of particular note is the specific activity data wherein ft is shown that the purified protein 
which we have obtained has twice the specific activity of that which was pubfished by Uliah. 

The present invention further provides nucleotide sequences encocfing proteins exhibiting phytase 
activity, as well as amino acid sequences of these proteins. The secuences provided may be used to 
design oligonucleotide probes which may In turn be used in hybridization screening studies for the 
identification of phytase genes from other species, especially microbial species, which may be subse- 
quently Isolated and cloned. 

The sequences provided by the present Invention may also be used as starting materials for the 
construction of "second generation" phytases. "Second generation" phytases are phytases, altered by 
mutagenesis techniques {e.g. site-directed mutagenesis), which have properties that differ from those of 
wild-type phytases or recombinant phytases such as those produced by the present invention. For example, 
the temperature or pH optimum, specific activity or substrate affinity may be altered so as to be better 
.suited for application in a defined process. 

Within the context of the present Invention, the term phytase embraces a family of enzymes which 
catalyze reactions Involving the removal of inorganic phosphorous from various myoinositol phosphates. 

Phytase activity may be measured via a number of assays, the choice of which is net critical to the 
present invsption. For purposes of illustration, phytase activity may be determined by measuring the 
amount of enzyme which liberates Inorganic phosphorous from 1.5 mM sodium phytate at the rate of t 
umol/min at 37* C and at pH 5.50. 

It should be noted that the term "phytase* as recited throughout the text of this specification is intended 
to encompass all peptides and proteins having phytase activity. This point is illustrated in Rgure 1A which 
compares sequences A and 8 (sequences which have been obtained during the course of the present work) 
with sequence C {published by Ullah, 1988b). The Figure demonstrates mat proteins may be obtained via 
the present invention v/hich lack the first four amino acids (the protein of sequence A lacks the first seven 
amino acids) oi the mature A. ficuum phytase protein. These prcteins, however, retain phytase activity. The 
complete amino acid sequence of the phytase protein, as deduced from the corresponding nucleotide 
sequence, Is shown in Figure 8. 

Phytases produced via the present invention may be applied to a variety of processes which require the 
conversion of phytate to inositol and inorganic phosphate. 

For example, the production of phytases according to the present Invention will reduce production costs 
of microbial phytases in order to allow its economical application in animal feed which eventually will lead to 
an in vivo price/performance ratio competitive with inorganic phosphate. As a further benefit the phes- 
phorus"content of manure will be considerably decreased. 

It will be appreciated that the application of phytases, available at a price competitive with inorganic 
phosphate, will increase the degrees of freedom for the compound feed industry to produce a high quality 
feed. For example, when feed is supplemented with phytase, the addition of inorganic phosphate may be 
omitted and the contents of various materials containing phytate may be increased. 

In addition to use in animal feeds and soy processing as discussed above, the phytase obtained via the 
present invention may also be used in diverse industrial applications such as: 

- liquid feed for pigs 8nd poultry. It has become common practice to soak feed for several hours prior to 
feeding. During this period the enzyme will be able to convert phytate to inositol and inorganic phosphate; 

• an industrial process for the production of inositol or inositoi-phosphates from phytate; 

♦ other Industrial processes using substrates that contain phytate such as the starch industry and in 
fermentation industries, such as the brewing industry. Chelation of metal ions by phytate may cause these 
minerals to be unavailable for the production microorganisms. Enzymatic hydrolysis of phytate prevents 
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these problems. 4 . 

These and other objects end advantages of the present invention will become apparent from the 

following detailed description. 

Brief description of the Figures 

R8 A W N-termlnal amino acid sequences as determined for purified phytase. The amino acid sequences 
labeled A and B are provided by the present Invention, and originate from the phytase subforms with 
isoelectric points of 5.2 and 5.4, respectively. Sequence C Is cited from Ullah (1987, 1988b supra). 
The amino acid residue located at position 12 of sequences A and B has been determined by the 
present invention not to be a glycine residue, f denotes no unambigous identification. - denotes no 

is residue detected.] , ^ _ . . 

B N-terminal amino acid sequences of CNBr-cteaved internal phytase fragments. The ammo acid 
sequences labeled A and B (apparent molecular weight approximately 2.5 kDa and 36 ^ peptides, 
respectively) are provided by the present Invention. Sequences C through E are cted from Ullah 
(1988b, supra). 

C. N-terminal amino acid sequence of a 100 kDa protein which has been found by the present 
invention to be present in crude phytase samples. 

F ' 9 A. r Oligonucleotide probes designed on basis of the data from Figure 1 A, peptides A through E. 

B Oligonucleotide probes designed on the basis of the data from Figure 1B. peptides A and B. 
Rgure 3 Oligonucleotide probes used for the isolation of the gene encoding the acid-phosphatase. 
Figure 4. Restriction map of bacteriophage lambda AF201 containing the phytase locus of A. ficuum . 
The arrow indicates the position of the phytase gene and the direction of transcription. Clone # shows 
the subclones derived with indicated restriction enzymes from phage AF201 in pAN 8-1 (for pAF 28-1) 
ano in pUCl9 (for all other subclones). . 
Figure 5. Physical map of pAF 1-1. The 10 kb Bam HI fragment, inserted in pUC19, contains the entire 
gene encoding acid phosphatase from A. fteuum . . *,-„-» 

Figure 6. Compilation of the nucleotide sequences of plasmids pAF 2-3, pAF 2-8, and pAF 2-7 
encompassing the chromosomal phytase gene locus. The phytase coding region is located from 
nucleotide position 210 to position 1713; an intron is present In the chromosomal gene from nucleotide 
position 254 to position 355. Relevant features such as restriction sites, the phytase start and stop 
codons, and the Intron position are indicated. 

Rgure 7. Detailed physical map of the sequenced phytase chromosomal locus; the arrows indicate the 
location of the two exons of the phytase coding region. 

Figure 8 Nucleotide sequence of the translated region of the phytase cDNA fragment and the denved 
amino acid sequence of the phytase protein; the start of the mature phytase protein w indicated as 
position +i. The amino-terminus of the 38 kDa internal protein fragment la located at amino acid 
position 241 , whereas the 2.5 kDa protein fragment starts at amino acid position 380. 
Rgure 8. Physical map of the phytase expression cassette pAF 2-2S. Arrows indicate the erection of 
transcription of the genes. . 
Figure 10 IEF-PAGE evidence of the overexpression of phytase in an A. ficuum NRRL 3135 transior- 
mant Equal volumes of culture supernatant of A. ficuum (lane 1) and transformant pAF 2-2S SP7 (lane 
2) grown under identical conditions, were analysed on a Phast-System (Pharmacia) IEF-PAGE gel in the 
pH-ranoe of 4 5-6. For comparison, a sample of A. ficuum phytase. purified to homogeneity was included 
either separately (lane 4), or mixed with a culture supernatant (lane 3). The gels were either mined with 
a phosphatase stain described In the text (A), or with a general protein stain (Coomassie BnlRant Blue, 
B). The phytase bands are indicated by an asterisk. 

Figure 11 IEF-PAGE evidence for the overexpression of phytase in A. niger CBS 513.88 transformants. 
Equal volumes of culture supernatants of the A. niger parent strain (lane~iFor the transformants pAF 2- 
2S #8 (lane 2) pFYT3 #205 Cane 3) & #282 flane~4T were analysed by IEF- PAGE as described In the 
legend of Fig." 10. The gels were either stained by a general phosphatase activity stain (A) or by a 
general protein stain (B). Phytase bands are indicated by an asterisk. 

Figure 12. Physical map of pAB 8-1. The 14.5 kb Hin dill DNA insert m pUClQ contains the entire 
giucoamylase (AG) locus from A. niger . 
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Figure 13. A schematic view of the generation of AG promoter/phytase gene fusions by the polymerase 
chain reaction (PCR). The sequences of all oligonucleotide primers used are indicated in the text. 
Figure 14, Physical map of the phytase expression cassette pAF 2-2SH. 

Figure 15. Physical maps of the intermediate constructs pXXFYTl, pXXFYT2 and the phytase expression 
cassettes pXXFYT3. wherein XX Indicates the leader sequence <L). In p18FYT# and p24FYT#, respec- 
tively the 18 aa and the 24 aa AG leader sequence are inserted whereas in pFYT#, the phytase leader is 
used. 

Figure 16. Physical map of plasmid pFYT3AamdS. 
Figure 17. Physical map of plasmid pFYT3lNT. 

Figure 18. Physical map of the phytase/AG replacement vector pREPFYT3. 

Figure 18. Autoradicgraphs of chromosomal DMA, digested with Pvu II (A) and Bam HI (B) and 
hybridized with the 22 P-labeled A. ficuum phytase cDNA as prcbe of tRTmlcrobial species S. cerevisiae 
(lane 2); B. subtilie (lane 3); K. ladiTfllne 4); P. crysogenum (lane 5); P. aeruginosa (lane 6); S. lividans 
(lane 7); A. rffgefT ug (lane 8); Qger 5 ug (lane 9); blank (lane 10); ( Tthermocellum (lane 11). Une 1: 
marker DNA. 

Detailed Description of the invention 



The cloning of the genes encoding selected proteins produced by a microorganism can be achieved in 
various ways. One method is by purification of the protein of interest subsequent determination of its N- 
terminal amino acid sequence and screening of a genomic library of said micro-organism using a DNA 
oligonucleotide probe based on said N-terminal amino acid sequence. Examples of the successful 
application of this procedure are the cloning of the Isopenicillin N-synthetase gene from Cephalosporin 
acremonium (S.M. Samson et al . (1985) Nature 318 . 191-194) and the isolation of the gene encoding the 
TAKA amylase for Aspergillus oryzae (Boel et al . (1986) EP-A-0238023). 

Using tnis procedure, an attempt has been made to isolate the Aspergillus ficuum gens encoding 
phytase. The protein has been purified extensively, and several biochemical parameters have been 
determined. The data obtained have been compared to the data pubSshed by Ullah (1988a). Both sets of 
data are given in Table 1, below. 
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Table 1 



ft flftrhflmieal paramet ers of pyrjfff ad wtld-tvoc A. — £i£UHJa 



Parameter 


present invention 


Til 1 ah 


Specific activity* 




50 u/Bici orotein 


Purity: SDS-PAGE 


85 kDa 


oj / XUU JuJa 


: IEF-PAGE 


3 or 4 bands 


not done 


Km (Affinity constant) 


250 fM 


40 MM 


Specificity for: 






Inositol-i-P 


not active 


not active 


Inositol-2-P 


Km - 3.3mM 


5% activity 


pH optimum 


2.5 and 5.5 


2.5 and 5.5 


Temp, optimum (*C) 


50 


58 


MW (kDa)** 


85 


85 and 100 


MW (unglycosylated) ** 


56*5 


61.7 


isoelectric Point*** 


5.0-5.4 


4.5 



* Phytase activity is measured by Ullah at 58 'C rather 

than at 37 *C. A unit of phytase activity is defined as that 
amount of enzyme which liberates inorganic phosphorus from 
1.5 mM sodium phytate at the rate of 1 /mol/min at 37 'C and 
at pH 5.50. To compare the fermentation yields and the 
specific activities, the activities disclosed by Ullah were 
corrected for the temperature difference. The correction is 
based on the difference in phytase activity measured at 37 «C 
and at 58'C as shown in Table III of Ullah (1988b). 

** Apparent Molecular Weight as detenained by SDS-PAGE. 



*** As determined by IEF-PAGE 

In order to isolate the gene encoding phytase, a first set of oligonucleotide probes was designed 
according to the above-dsscribed method (Fig. 2A). The design of these probes was based on the ammo 
acid sequence data. As a control for the entire procedure, similar steps were taken to isolate the gene 
encoding acid-phosphatase, thereby using the protein data published by Uilah and Cummins ((1887) Prep. 
Biochem. 17 . 397-422). For acid-phosphatase, the corresponding gene has been isolated without difficul- 
ties. Howevir. for phytase. the situation appeared to be different. Despite many attempts in which probes 
derived from the N-terminal amino acid sequence were used, no genomic ONA fragments or clones from 
the genomic library could be Isolawd which could be positively identified to encompass the gene encoding 

phytase. , „ 

To overcome this problem, the purified phytase was subjected to CNBr-drected cleavage and the 
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resulting protein fragments were Isolated. The N-termlnal amino acid sequences of these fragments were 
determined (Fig. IB), and new oligonucleotide probes were designed, based on the new data (Rg. 2B). 
Surprisingly, the new oligonucleotide probes did identify specific DNA fragments and were suited to 
unambiguously identify clones from a genomic library. No cross hybridization was observed between the 
new clones cr DNA fragments isolated there from, and the first set of oligonucleotide probes cr the clones 
isolated using the first set of probes. 

It will be appreciated that this second set of probes may also be used to Identify the coding sequences 

of related phytases. 

The newly isolated clones were used as probes In Northern blot hybrldl2ations. A discrete mRNA could 
only be detected when the mRNA was isolated from phytase producing mycelium. When RNA from non- 
phytase producing mycelium was attempted, no hybridization signal was found. The mRNA has a size of 
about 1800 b, theoretically yielding a protein having a maximal molecular weight of about 80 kDa- This 
value corresponds to the molecular weight which has been determined for the non-glycosylated protein, and 
the molecular weight of the protein as deduced from the DNA sequence. 

Mcreover, when introduced into a fungal cell by transformation, an Increase In phytase activity could be 
demonstrated. This indicates conclusively that the nucleotide sequence encoding phytase has indeed been 
isolated. The amino acid sequences which have been determined for the purified phytase enzyme, and for 
the ONBr fragments obtained therefrom, concur with the amino acid sequence deduced from the sequence 
which was determined for the cloned gene. The nucleotide sequence and the deduced amino acid 
sequence are given In Figures 6 and 8, and further illustrate the cloned sequence encoding phytase. 

The isolation of the nucleotide sequence encoding phytase enables the economical production of 
phytase on an industrial scale, via the application of modem recombinant DNA techniques such as gene 
amplification, the exchange of regulatory elements such as e.g. promoters, secretional signals, or combina- 
tions thereof. 

Accordingly, the present invention also comprises a transformed expression host capable of ttie efficient 
expression cf high levels of peptides or proteins having phytase activity and, if desired, the efficient 
expression of acid phosphatases as well. Expression hosts of interest are filamentous fungi selected from 
the genera Aspergillus , Trichoderma , Mucor and Penlclilium , yeasts selected from the genera 
Kiuyveromyces and Saccharomyces and bacteria of the genus Bacillus . Preferably, an expression host is 
selected which is capable of the efficient secretion of their endogenous proteins. 

Of particular interest are Industrial strains of Aspergillus , especially niger , ficuum , awamori or oryzae . 
Alternatively, Trichoderma reesel , Mucor miehei , Kiuyveromyces lactls . Saccharomyces cerevisiae , 
Bacillus subtills or Bacillus lichenlformis may be used. 

The expression construct will comprise the nucleotide sequences encoding the desired enzyme product 
to be expressed, usually having a signal sequence which is functional In the host and provides for secretion 
of the product peptide or protein. 

Various signal sequences may be used according to the present invention. A signal sequence which is 
homologous to the cloned nucleotide sequence to be expressed may be used. Alternatively, a signal 
sequence which is homologous or substantially homologous with the signal sequence of a gene at the 
target locus of the host may be used to facilitate homologous recombination. Furthermore, signal sequences 
which have been designed to provide for Improved secretion from the selected expression host may also be 
used. For example, see Von Heyne (1983) Eur. J. Biochem. V33 , 17-21; and Perlman and Halverson (1983) 
J. McL Biol. 187 , 391-409. The DNA sequence encoding the signal sequence may be joined directly 
through the sequence encoding the processing signal (cleavage recognition site) to the sequence encoding 
the desired protein, or through a short bridge, usually fewer than ten codcns. 

Preferred secretional signal sequences to be used within the scope of the present Invention are the 
signal sequence homologous to the cloned nucleotide sequence to be expressed, the 18 amino acid 
glucoamylase (AG) signal sequence and the 24 amino add glucoamylase (A6) signal sequence, the latter 
two being either homologous or heterologous to the nucleotide sequence to be expressed. 

The expression product, or nucleotide sequence of Interest may be DNA which is homologous or 
heterologous to the expression host 

"Homologous" DNA is herein defined as DNA originating from the same genus. For example, 
Aspergillus is transformed with DNA from Aspergillus . In this way it is possible to Improve already existing 
properties of the fungal genus without Introducing new properties, which were not present In the genus 
before. 

"Heterologous" DNA is defined as DNA originating from more than one genus, I.e., as follows from the 
example given in the preceding paragraph, DNA originating from a genus other than Aspergillus . which is 
then expressed in Aspergillus . 
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Nucleotide sequences encoding phytase activity are preferably obtained from a fungal source. More 
preferred are phytase encoding nucleotide sequences obtained from the genus Aspergillus . Most preferred 
seauences are obtained from the species Aspergillus fiojum or Aspergillus niger . 

The region 5' to the open reading frame in the nucleotide sequence of interest will comprise the 
transcriptional initiation regulatory region (or promoter). Any region functional in the host may be employed, 
including the promoter which is homologous to the phytase-encoding nucleotide sequence to be expressed 
However, for the most part the region which is employed will be homologous with the reg.on of the target 
locus This has the effect of substituting the expression product of the target locus with the expression 
product of interest. To the extent that the level of expression and secretion of the target locus encoded 
protein provides for efficient production, this transcription initiation regulatory region will normally be found 
to be satisfactory. However, in some instances, one may wish a higher level of transcription than the target 
locus gene or one may wish to have inducible expression employing a particular inducing agent. In those 
instances, a transcriptional initiation regulatory region will be employed which Is different from the regton In 
the target locus gene. A large number of transcriptional initiation regulatory regions are known which are 
functional in filamentous fungi. These regions include those from genes encoding Jteoniytom 
fungal amytese. acid phosphata se, QAPDH. Trp C. Amd S. Ata A, Aid A, hlstone H2A. Pyr 4. Pyr G. 
isopenicillin N synthetase. PGK. acid protease, acyl transferase, and the like. 

The target locus will preferably encode a highly expressed protein gene, i.e.. a gene whose expression 
product is expressed to a concentration of at least about 0.1 g/l at the end of the fermenteton process. The 
duration of this process may vary inter alia on the protein product desired. As ar. example of such a gene 
the gene encoding glucoemylase (AW illustrative. Other genes of interest Include fungal ,-amylas* aad 
phosphatase, protease, acid protease, lipase, phytase and cellobiohydrolase. Especlaily preferred target loa 
are the glucoamylase gene of A . niger . the fungal amylase gene of A . oryzas . the cellcbKhydrolase 
genes of T . reesei . the acid prot*a"iilene of Mucor miehei . the lactase gene of Kluyveromyces lacts or 
the invertase gene of Saccharomyces cerevisiae . 

The transcriptional termination regTrtatoryligion may be from the gene of interest, the target locus or 
any other convenient sequence. Where the construct includes further sequences of Interest downstream (,n 
the direction of transcription) from the gene of interest, the transcriptional termination regulatory reg»n. If 
homologous with the target locus, should be substantially smaller than the homologous flanking reg.on. 

A selection marker is usually employed, which may be part of the expression construct or separate 
from the expression construct, so that it may integrate at a site different from the gene of interest. Since the 
recombinant molecules of the invention are preferably transformed to a host strain that can be used for 
industrial production, selection markers to monitor the transformation are preferably dominant seechon 
markers i.e., no mutations have to be Introduced into the host strain to be able to use tnese select™ 
markers Examples of these are markers that enable transformants to grow on defined nutrient sources (e.g. 
the A nidulans amd S gene enables A. niger transformants to grow on acetamide as the sole nitrogen 
source) bTTnlrTers-Fat confer re sistance~toir¥biolics (e. g., the bte gene confers resistance to phleomycn 
or the hph gene confers resistance to hygromycin B). 

TheTelection gene will have its own transcriptional and translational initiation and termination regulatory 
regions to allow for independent expression of the marker. A large number of transcriptional initiation 
regulatory regions are known as described previously and may be used in conjunction *ith the marker 
gone Where antibiotic resistance is employed, the concentration of the antibiotic tor seiecbon will vary 
depending upon the antibiotic, generally ranging from about 30 to 300 ag/ml of the antibiotic. 

The various sequences may be Joined In accordance with known techniques, such as restoration, joining 
complementary restriction sites and ligatJng. blunt ending by filling in overhangs and blunt ligation. Bal 31 
resection, primer repair, in vitro mutagenesis, or the like. Poiylinkers end adapters may be employed, when 
appropriate, and introduced or removed by known techniques to allow for ease of assembly of the 
expression construct At each stage of the synthesis of the construct, the fragment may be cloned, 
analyzed by restriction enzyme, sequencing or hybridization, or the like. A large number of vectors are 
available for cloning and the particular choice is not critical to this invention. Normally, cloning will occur in 

~ ^he flanking regions may include at least part of the open reading frame of the target locus, particularly 
the slgnai sequence, the regulatory regions 5 and 3 of the gene of the target locus, or may extend beyond 
the regulatory regions. Normally, a flanking region will be at least 100 bp. usually at least 200 bp. and may 
be 500 bp or more. The flanking regions are selected, so as to disrupt the target gene and prevent its 
expression. This can be achieved by inserting the expression cassette (comprising the nucleotide sequence 
to be expressed and optionally Including additional elements such as a signal sequence, a transcriptional 
initiation regulatory region sequence and/or a transcriptional termination regulatory region sequence) into 
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the open reading frame proximal to the 5 region, by substituting all or a portion of the target gene with the 
expression construct, or by having the expression construct intervene between the transcriptional initiation 
regulatory region at the target locus and the open reading frame. As already indicated, where the 
termination regulatory region is homologous with the region at the target iccvs, the 3 -flanking region should 

5 be substantially larger than a termination regulatory region present in the construct 

The present Invention also provides the starting material for the construction of 'second-generation 1 
phytases, i.e. phytase enzymes with properties that differ from those of the enzyme isolated herein. 
Second-generation phytases may have a changed temperature or pH optimum, a changed specific activity 
or affinity for its substrates, Or any other changed quality thai makes the enzyme more suited for application 

io .n a defined process. E. coli is the best host for such mutagenesis (e. g. site-directed mutagenesis). Since 
E. coli lacks the splicing machinery for the removal of introns which might be present in the phytase gene, 
TcBFlA clone of phytase Is the sequence of choice to be expressed in EL coli . This cDNA sequence can 
be readily mutated by procedures well known in the art, after which the mutated gene may be introduced 
into the desired expression constructs. 

ra The construct may be transformed into the host as the cloning vector, either linear or circular, or may 
be removed from the cloning vector as desired. The cloning vector is preferably a plasmld. The plasrnid will 
usually be linearized within about 1 kbp of the gene of interest Preferably, the expression construct for the 
production of the phytases of the present invention will be Integrated into the genome of the selected 
expression host 

20 A variety of techniques exist for transformafion of filamentous fuigi. These techniques include 
protoplast fusion or transformation, electroporation and micro-projectile firing into cells. Protoplast trans- 
formation has been found tc be successful and may be bed with advantage. 

Mycelium of the fungal strain of interest is first converted to protopass by enzymatic digestion of the 
cell wall in the presence of an osmotic stabilizer such as KCI or sorbitol. DNA uptake by the protoplasts Is 
is aided by the addition of CaCi 2 and a concentrated solution of polyethylene glycol, the latter substance 
causing aggregation of the protoplasts, by which process the transforming DNA is Included in the 
aggregates and taken up by the protoplasts. Protoplasts are subsequently allowed to regenerate on solid 
medium, containing an osmotic stabilizer and, when appropriate, a selective agent, for which the resistance 
is encoded by the transforming DNA. 
30 After selecting for transformants, the presence or the gene of Interest may be determined in a variety of 
ways. By employing antibodies, where the expression product Is heterologous to the host, one can detect 
the presence of expression of the gene of interest. Alternatively, one may use Southern or Northern biots to 
detect the presence of the integrated gene or Its transcription product 

Amplification of the nucleotide sequence or expression construct of interest may be achieved via 
35 standard techniques such as, the introduction of multiple copies of the construct In the transforming vector 
or the use of the amd S gene as a selective marker (e.g. Weinans et al . (1985) Current Genetics, 9 . 361- 
368). The DNA s^uehce to be amplified may comprise DNA which Is either homologous or heterologous to 
the expression host, as discussed above. 

The cells may then be grown in a convenient nutrient medium. Low concentrations of a protease 
40 inhibitor may be employed, such as phenylmethyisulfonyl fluoride, a2-macro-globulins, pepstatin, or the 
like. Usually, the concentration will be in the range of about 1 ug/ml to 1 mg/ml. The protease gene(s) may 
be inactivated in order to avoid or reduce degradation of the desired protein. 

The transformants may be grown in either batch or continuous fermentation reactors, where the nutrient 
medium is isolated and the desired product extracted. 
45 Various methods for purifying the product, if necessary, may be employed, such as chromatography (e. 
g., HPLC), solvent-solvent extraction, electrophoresis, combinations thereof, or the like. 

The present invention also provides a downstream processing method in which the fermentation broth 
(optionally purified) is filtered, followed by a second germ-free filtration, after which the filtered solution is 
concentrated. The thus-obtained liquid concentrate may be used as follows: 
50 a) Phytase and other proteins may be precipitated from the liquid concentrate by adding acetone to a 
final volume of 60% (v/v) under continuous stirring. The precipitate may be dried In a vacuum at 35 C. 
After grinding the dry powder, the enzyme product may be used as such for application experiments. 
Recovery yields are about 90%. 

b) The liquid concentrate may be spray-dried using conventional spray-drying techniques. Recovery 
55 yields vary from 80 to 99%. 

c) The liquid concentrate may be mixed with carrier materials such as wheat bran. The thus-obtained 
mixture may be dried in a spray tower or in a fliid bed. 

d) The liquid concentrate may be osmotically stabilized by the addition of e.g. sorbitol. A preservative 
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such as benzoic acid may be added to prevent microbial contamination. 

All four formulations may be sold to premlx manufacturers, compound feed industries, other distributors 

^irex'amples herein are given by way of Illustration and are in no way intended 
the present invention. It will be obvious to those skilled in the art that the phytase gene ofthe mMM« 
be used in heterologous hybridization experiments, directed to the isolat.cn of phytase encoding genes from 
other micro-organism3. 

Example 1 



Fermentation of A. ficuum NRRL 3135 

Aspergillus ficuum strain NRRL 3136 was obtained from the Northern Region Research Lab. USDA 
iSlfSSivS-Street, Peoria. Illinois, USA. Fungal spore preparations were made fotiowmg standard 

^Sres' and subsequently cells were transferred through a series of batch fermentations in ' Erlenmeyer 
flasks to a 10 I fermentor. After growth in batch culture, the contents of th,s fermentoT were used as 
inftojlum for a final 500 liter batch fermentation. 

t ^a ld contains: 91 fl/l corn starch (BDH Chemicals Ltd.); 38 
MgS0..7H,0; 0.6 g/l KCI; 0.2 g/1 FeSO*.7H,0 and 12 g/l KN0 3 . The pH was maintained at 4.6 * 0.3 by 
automatic titration with either 4N NaOH or 4N H ? SO«. 

Ceils were grown at 28* C at an automatically controlled dissolved oxygen concentration of 25 ^ air 
saturation. Phytase production reached a maximum level of 5-10 U/ml after 10 days of fermentation. 

Example 2 



30 
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Purification and characterization of A. ficuum phytase 



A, Phytase activity assay 

100 ul or broth filtrate (diluted when necessary) or supernatant or 100 ill of demiwater as reference are 
added to an incubation mixture having the following composition: 
4c - 0.25 M sodium acetate buffer pH 5.5, or 

- glycine HCL-buffer pH 2.5 

- 1mM phytic acid, sodium salt 

" ^St^ting Sure is incubated for 30 minutes at 37* C. The reaction is stopped by the addition of 1 
.5 ml of 10% TCA (trichloroacetic add). After the reaction has terminated. 2 ml of reagent (3.66 g of 

FeSO^ in 50 ml of ammonium molybdate solution (2.5 g (NH.) e Mo 7 0,*'4H 2 0 and 8 ml H 2 SO.. 

diluted up tc 250 ml with demiwater)) Is added. 

The intensity of the blue color is measured spectro-photometrically at 750 nm. The measurements are 

indicative of the quantity of phosphate released in relation to a calibration curve of phosphate in the range 
so of 0 - 1 mMol/1. 



Phosphatase stain 



55 



Components with phospatase activity were detected by isoelectric focusing using a general 
ohosphatose stain. The gel was incubated with a solution of o-naphthylphosphate and Fast Garnet GBC salt 
E To &T2*, (wXespectively) In 0.8 M sodium acetate buffer pH 5.5. The reaction, which results In 
the "appearance" of a black precipitate, was either terminated with methanol:acetic acid (30:10 %. v/v). or. 
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should the protein having phytas© activity be further required, by rinsing with distilled water. 

6. Purification of A. ftcuum phytase 
5 ~~ 

Phytase was purified to homogeneity from the cufture broth of A. ficuum NRRL 3135. The broth was 
first made germ-free by filtration. The resulting culture filtrate was sub sequently further concentrated in a 
Flltron ultrafiltration unit with 30 kD cutoff filters. The pH and ionic strength of the sample were adjusted for 
the purification procedure by washing the sample with 10 mM sodium acetate buffer pH 4,5. The final 

to concentration in this ultrafiltration procedure was approximately 20 fold. 

The sampie was then applied to a cation exchanger (S-Sepharose Fast-Flow in a HR 18/10 20 ml 
column, both obtained from Pharmacia) in a Waters Preparative 650 Advanced Protein Purification System. 
The proteins bound were eluted with a sodium chloride gradient from 0*1 M in the sodium acetate buffer. 
Phytase eluted at approximately 250 mM NaCI. Phytase activity containing fractions were pooled, con- 
re centrated and desalted by ultrafiltration. The resulting solution was applied to an anion exchanger (O- 
Sepharose Fast-Flow in a HR 16/10 20 ml oolumn, Pharmacia), and the proteins were again eluted by a 
sodium chloride gradient from 0-1 M in the acetate buffer described above. Phytase was eluted from this 
column at approximately 200 mM NaCI. 

The result of these purification steps Is a partially purified phytase preparation with a specific activity of 

20 approximately 40-50 U/mg protein, Indicating a 25-fold purification. 

Analysis of the purity of the partially purified phytase Indicated the presence of a major impurity with a 
molecular weight of approximately 100 kDa (Fig. 1B, sequence E). Isoelectric focusing indicated the 
presence of a number of phosphatase activity containing enzymes, including 3-4 phytase subforms 
(Isoelectric points varying from 5.0-5.4) (Fig 1 A, sequences A and B). 

25 In order to obtain a homogeneous phytase preparation, a further two-fold purification was achieved by a 
subsequent separation of the components of the partially purified phytase by isoelectric focusing in a LKB 
Multiphor system on Ampholine PA6 plates (pH range 4-6.5). The proteins with phosphatase activity 
(including the phytase) were detected by the general phosphatase staining procedure described above. The 
bands of interest were subsequently excised from the gel and the active protein was eluted by a 18 hr 

30 incubation of the gel slices In 10 mM sodium acetate buffer 5.5. The protein fractions were analysed In the 
specific phytase activity assay, as described in Example 2, thus discriminating the phytase fractions from 
other acid phosphatases. The final purification factor for phytase was approximately 60 bid (specific activity 
of final preparation 100 U/mg protein). In this final purification step it was also possible to isolate different 
subforms of phytase (Fig. 1A, sequences A and B). 

35 Monoclonal antibodies directed against the A. ficuum phytase were prepared, providing an effective 
purification procedure. The antibody was coupled to cyanogen bromide-activated Sepharose 4B (5 mg/mf 
gel), and this matrix was used in a immunoaffinity column. The matrix was shown to bind approximately 1 
mg phytase per ml. The phytase couid be eluted from the affinity column with a pH 25 buffer (100 mM 
glycine-HCI, 500 mM NaCI) without any loss of activity. This procedure can be used to isolate homo- 

40 geneous phytase from a crude culture filtrate In one single step with an 80% recovery and a 60-fold 
purification. 



45 



C. Degiycosylation of phytase 



A. ficuum phytase (70 wg protein) was incubated with 2.6 U N-Qlycanase (Genzyme) in 0.2 M sodium 
phosphatTbuffer pH 8.6 and 10 mM 1,10-phenanthroHne In a total volume of 30 til. 

After 16 hrs at 37* C, the extent of degiycosylation was checked by electrophoresis (Phast System, 
Pharmacia). The apparent molecular weight of the phytase was found to decrease from 85 kDa to 
6C approximately 56.5 kDa. The periodic acid Schiff (PAS) sugar staining, which identifies native phytase as a 
glycoprotein, failed to detect any residual carbohydrates attached to the protein. The complete removal of 
carbohydrate was further substantiated by the sensitive lectln-blotting method. Native and deglycosylated 
phytase (both 1.5 ug) were run on a standard SDS-PAGE gel and electrophoreticaily transferred to a PVDF 
membrane (Immobilon, Mlllipore) In 25 mM TRIS*giycine buffer pH 8.3, 20% (v/v) methanol, fa a period of 
55 16 hrs at 30V. 

The membrane was subsequently incubated with 1% (wfr) bovine serum albumin in phospate buffered 
saline and incubated with concanavaiin A-peroxidase (Sigma, 10 ug.'ml in phosphate buffered saline). The 
peroxidase v/as then stained with 4-chIoro-1-naphthol (Sigma). 
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This sensitive method also failed to detect any residual carbohydrafc attached to the deglycosylated 

^Afte'r deglyoosylatlon, phytase has completely lost Its activity, possibly due to aggregation of the 
enzyme. 

Example 3 
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Dejem^nation of the amino acid sequence of phytase and design of oligonucleotide probes 

A. Determination of the N-termlnal amino acid sequence 

Phytase was electrophoreses transferred from SDS-PAGE or from IEF-PAGE onto a PVDF blotting 
JSml (Immobilon. Millies). Bectroblotting was performed In 10 ^^^"J^ 
Drooanesulfonic acid) buffer pH 11.0. with 10% <v/v) methanol, tor a penod of 16 hrs. at 30V and U. 

protein was located with Coomassie Brilliarrt Blue staining. The band of .nterest was excised 
further detained in methanol and subjected to gas-phase sequencing. The procedure has been earned out 
several times, using several individual preparations. The results obtained are g.ven .n Figure 1 A (sequences 

A ^hfamlno acid sequence has also been determined tor a 100 kDa protein that was present in crude 
preparations. The data obtained for this protein are given In Figure 1C. This sequence shows co naderable 
homology with the acid phosphatase that has been Isolated from Aspergillus niger (MacRae et al . (1988) 
Gene 71_ , 339-348). 

B. Determination of internal amino acid sequences 

Protein fragmentation by cyanogen bromide 

Phytase purified to homogeneity, was transferred into 100 mM NaHCOs by ultrafiltration 
(Mlcroconcentrator Centricon 30. Amlcon). The protein was subsequently lyoph. zed. d.ssolved I .n 70. o 
Mluoroacetic acid (v/v). and incubated for 8 hr with an approximately 300-fold molar excess of CNBr. Tne 
reaction was terminated by dilution of the mixture with water. The resulting fragments again 
Ivoohllized. The sample was then dissolved in SDS-PAGE sample buffer containing DTT (drthiothreitol). and 
the extent of fragmentation was determined by PAGE. Analytical PAGE was performed on a Pharmacia 
Phast-System unit, on 20% SDS-PAGE gels. The gels were prerun to create a continuous buffer system to 
improve the separation of the small peptides (according to the manual). Peptides wens detected using a 
sllver-staining technique known In the art. since Coomassie Brilliant Blue failed to detect the smallest 
peptide. The result of the procedure was a complete degradation of phytase into peptides with molecular 
weiohts of < 2 5 kDa, 36 kDa, 57 kDa and 80 kDa. 
45 The peptides were isolated for gas-phase sequencing by SDS-Tricine-PAGE as described by Schagger 
& jagow (1987) Anal. Blochem. 166 . 36*379 followed by electroblotting as described above. 

The N-terminus of the 57 kDTfragment Is identical to the N-terminus of phytase as determmed by Ullah 
(19B8b supra), with the exception of the first four amino acids which are absent (Figure 1A, sequence o). 
The N-terminal sequences of the 2.5 kDa and 38 kDa peptides are shown in Figure 1B as sequences A and 
so B. 

C. Oligonucleotide probes 

ss oiloonudeatlde probes have been designed, based on the amino add sequences given in Figure 1A 
and IB, and were prepared using an Applied Blosystems ABI 380B DNA synthesizer. These 
oligonucleotides are given In Figure 2A and 2B. 
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Example 4 



Hybridization of genomic blots and genomic libraries with a first set of oligonucleotide probes 



Genomic DNA from A. ficuum has been isolated by grinding the mycelium in liquid nitrogen, using 
standard procedures (e.g7Yeltor et al (1884) Proc. Natl. Acad. Sci.-U.SA, 1470-1474). A genomic library 
was constructed in the bacteriophage vector lambda EMBL3, using a partial Sau 3A digest of A. ficuum 
NRRL 3135 chromosomal DNA, according to standard techniques (e.g. Maniatls et at . (1982) Molecular 
cloning, a laboratory manual , Cold Spring Harbor Laboratory, New York). The thus-obtained genomic library 
contained 60 to 70 times the K ficuum genome. The library was checked for the occurrence of plaques 
without insert by hybridization whhlReTambda EMBL3 stuffer fragment. Less than 1% of the plaques were 
observed to hybridize to the lambda EMBL3 probe. The insert size was 13 to 17 kb. 

To identify conditions and probes that were suited for the screening of the genomic fibrary, genomic 
DNA was digested with several restriction enzymes, separated on agarose gels and blotted onto 
Genescreen plus, using the manufacturers instructions. The blots were hybridized with all oligonucleotide 
probes. Hybridization was performed usings conditions of varying stringency (6 x SSC. 40 to 60 * C for the 
hybridization; up to 0.2 x SSC, 66* C for the washing). Probes 1068 and 1024 (Figure 2A) were selected for 
the screening of the genomic library, afthough no common DNA fragments could be identified that 
hybridized specifically with both probes, Acid-phosphatase probe 1025 (Figure 3) gave a specific and 
discrete hybridization signal and hence this probe was selected for screening the genomic library for the 
acid phosphatase gene. 

Using all three probes, hybridizing plaques could be Identified in the genomic library. The hybridization 
signal corresponding to probe 1025 (acid phosphatase) was strong and reproducible. Hybridization signals 
of variable intensity were observed using probes 1024 and 1068 (phytase). No cross hybridization between 
the two series was observed. All three series of plaques were rescreened and DNA was Isolated from eight 
single, positive hybridizing plaques (Maniatis et al supra). In each series, denes that contained identical 
hybridizing fragments could be identified, indicating that the inserts of said clones are related and probably 
overlap the same genomic DNA region. Again, no cross-hybridization could be demonstrated using the two 
phytase specific series (probes 1024 and 1068). Indicating that, although both probes used to isolate the 
two series of clones were obtained from the N-terminal amino acid sequence of the protein, different 
genomic DNA fragments had been identified and cloned. 

Ail three series of clones were hybridized with Northern blots containing mRNA isolated from induced 
and non-induced mycelium (Example 6). The acid phosphatase-speciflc clones, as well as the isolated 
internal 3.1 kb Sail fragment from these clones, hybridized exclusively to induced mRNA samples. The 
mRNA identified by the acid phosphatase-speciflc probes is about 1800 b in length, which agrees with the 
known size of the protein (68 kDa. Ullah and Cummins (1987) Prep. Biochem. 17 . 397-422). No 
hybridization of the phytase-speeffle clones with specific mRNA's could be demonstrated. We have thus 
concluded that the above-described method was unsuccessful In cloning the gene encoding phytase. It may 
be further concluded that this failure Is not due to a failure in the method used, since the method has been 
successfully applied to identify the gene encoding acid phosphatase. The lambda clone containing the acid 
phosphatase gen9 was deposited on April 24, 1989 at the Centraal Bureau voor Schimmelcultures, Baam, 
The Netherlands and has been assigned accession number CBS 214.89. A 10 kb Bam HI fragment has 
been isolated from phage 21 and subcloned Into pUCl9. This subclone contains the entire gene encoding 
acid phosphatase. The subclone, pAF 1-1 (Figure 5) was deposited on April 24, 1989 as CBS 213.89. 



Example 5 



Isolation of the gene encoding phytase, using a second set of oligonucleotide probes 

Probes have been designed using the N-temnlnal amino acid sequence of CNBr-generated fragments 
(Figure 2B, probes 1295, 1296 and 1297) and have been hybridized with genomic DNA as described above. 
The feasibility of using these probes in the isolation of the gene encoding phytase was again studied by 
Southern hybridization of genomic blots with the probes. This time, hybridizing fragments of corresponding 
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lengths could be identified, using all three probes, despite the fact that the probes have been derived from 
non-overlapping regions. No hybridization was found between the new set of probes and the clones that 
have been isolated using the first set of probes {Example 4). Therefore, the genomic library was rescreened 
using all three probes in separate experiments. A subset of the clones (lambda AF201, 219. 241 and 243) 

s isolated with each individual probe also hybridized with both other probes, indicating that In this case, using 
the three different probes, clones were isolated from a single genomic region. Attempts were made to 
hybridize the newly isolated clones with probes 1024 and 1068. In both cases, no hybridization with the 
newry Isolated clones was observed under conditions in which both probes had successfully hybndlzed to 
the clones which were isolated using these probes (see Example 4). This demonstrates that the newly 

io isolated clcnes have no homology to the probes derived from the N-terminus of the purified phytase. 

A lambda EMBL3-clcne, which hybridizes to all three probes (1285-1297). was named lambda AF201 
(Figure 4) and was deposited on March 9, 1989 as CBS 1 55.89. 

A 5 1 kb Bam HI fragment of lambda AF201 (subcloned in P UC19 and designated pAF 2-3, see Figure 
4) hybridzingloall three oligonucleotide probes, was used to probe a Northern Wot In this case, a discrete 

f5 mRNA having a size of 1800 bases was identified. This mRNA was found only in induced mycelium. Similar 
results were obtained when the oligonucleotides were used as probes. Therefore, using the new sot of 
orobes a common DNA fragment has been Identified, which hybridizes specifically to an induced mRNA. 
The length of this mRNA (1800 b) Is sufficient to encode a protein of about 60 kDa. which is about the s.ze 
of'the ncn-giycosylated protein. Clearly, the isolated fragments contain at least part of the gene encoding 

20 phytase. 

Example 6 
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Isolation of "induced" and "non-Induced* mRNA 

It is known from the literature that the synthesis of phytase by A. ficuum is subject to a stringent 
ohosphate-dependent regulation (Han and Callagher (1987) J. Indust. MlcVobiol. 1, 295*01). Therefore, the 
demonstration that an isolated gene is subject to a similar regulation can be considered to support the 
evidence that the gene of interest has been cloned. 

In order to Isolate mRNA that has been synthesized under both producing and non-producing 
conditions A. ficuum NRRL 3135 was grown as follows. Spores were first grown overnight m nonnnduong 
medium The nSxTday. the mycelium was harvested, washed with sterile water and inoculated into either 
inducing or non-inducing medium. The medium used contains (per liter): 20 g com starch; 7.5 g glucose; 
0 5 g MgS(V7 H 2 0; 0.2 g FeS(V7 H 2 0; and 72 g KNOs. For the induction of phytase. up to 2 g/l com 
steep liquor was added to the medium, while non-inducing medium contains 2 g-1 K 2 HPO». The mycelium 
was grow for at least a further 100 hours. Samples were taken at selected intervals. Phytase production 
was followed by the phytase assay as described in Example 2A. Denatured mRNA was separated by 
electrophoresis and blotted onto Qenescreen plus. The blots were hybridized with »P-labelled pAF 2-3 or 
wtth the Isolated 3.1 kb Sal I fragment from pAF 1-1 (acid phosphatase) from Example 4. The results are 

shown in Table 2. . A ... 

Positive hybridization of the phytase specific 5.1 kb Bam HI fragment and the acid phosphatase specific 
3 1 kb Sal I fragment with isolated mRNA is observed only when cells are grown under conditions which are 
known tolnduce the synthesis of phytase and acid phosphatases. From these resutts it has been concluded 
that the Isolated genes are regulated as expected for phytase and acid ohosphatases. 



50 



55 
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Tablf 2 

Hybridization of Northern blots using the phytase- 
specific 5.1 kb PflpHI fragment (A) or the acid phosphatase 
specific 3.1kb Sai l fragment (B) as a probe; a + indicates 
the presence of the 1800 b phytase mRNA or the 1800 b acid 
phosphatase mRNA. The relative phytase activity was 
determined for the 24 hr. samples: induced cultures have 10 
times more phytase activity than non-induced cultures. 

Time after Induced Non-induced 

inoculation 



A 24 hours + 

B 24 hours + 



Example 7 



Evidence for the cloning of the phytase gene 

To obtain definitive proof for the successful isolation of the gene encocfing phytase, and to study the 
feasibility of increasing the expression of .the cloned gene, the phytase gene was subcloned into a suitable 
vector and transformed to A. niger 402 (ATCC 9092). To this end, the phytase gene was isolated from the 
lambda clone AF201 as a~0 kb Nru I fragment and cloned into the Stu 1 site of the vector pAN 8-1 
(Mattern, LE. and Punt PJ. (1988) Fungal Genetics Newsletter 35 , 25) which contains the ble gene 
(conferring resistance to phleomycin) as a selection marker. The resulting construct was named pAF 28-1 
(Figure 4) and was transformed to A. niger 402 according to the procedure as described in Example 9, with 
the exception that the protoplasts were plated on Aspergillus minima* medium supplemented with 30 ug 
phleomycia'ml and solidified with 0.75% agar. Single transformants were purffied and isolated and were 
tested for production in shake flasks, as described in Examples 1 and 2. As controls, transformants 
possessing only the vector, as well as the untransformed host were a!so tested (Table 3). Only A. niger 402 
containing pAF 2B-1 appeared to produce a phytase that reacted with a specific monoclonal antibody 
directed against A. ficuum phytase. The phytase reacting with this monoclonal antibody could be eluted 
from an ImmLmoTffinlty'cblumn at pH 2.5 and was shown to be identical in molecular weight, degree of 
glycosylating isoelectric point end specific activity to the A. ficuum phytase. This finding provides clear 
evidence that A. niger 402 cells transformed with pAF 28-1 express a phytase that is virtually identical to 
the A. ficuum phytase!" Similar expression was not observed In either type of control cells. 
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Table 3 



Strain 


Phytese/Activity 
U/ml 


% of phytaso-activity adsorbed 
onto the immunoaffinity column 


A. nlger 402 


0.5 


0 


A. niger 402 pAF 28-1 


0.7 


10 


A. niger 402 pAN 8-1 


0.5 


0 



Strains were grown under induced conditions (Example 6). Samples were taken after 86 hours of growth. 



Example 8 



Characterization of the phytase gene. 

The lambda clones containing the phytase gene have been analyzed by digestion with various 
restriction enzvmes. A map of the genomic region encompassing the phytase gene is given in Figure 4. 
Defined restriction fragments have been subcloned in the cloning vector pUCl9, as indicated in Figure 4. 

It has previously been shown (Example S) that the 5.1 kb Bam HI fragment present in pAF 2-3 
encompasses at least part of the phytase gene. Moreover the oligonucleotide probes 1295 and 1297 (Figure 
28) were shown to hybridize to the Sal I insert from pAF 2-7 (positions of pAF 2 clones are presented In 
Figure 4). while probe 1296 probably spans the Sal I site between the fragments in pAF 2-6 and pAF 2-7. 
The results of these experiments indfcate that the phytase encoding sequence is located In the lefthand part 
of the Bam HI insert of pAF 2-3. 

Subsequently the nucleotide sequences of the inserts of plasmlds pAF 2-3, pAF 2-6. and pAF 2-7 have 
been determined completely using Hie dideoxy chain termination method (Sanger et aJ . (1977) Proc. Natl. 
Acad. Sci. USA 74 , 5483-5467) and shotgun strategies described by Messing et al . (1981. Nucl. Acids 
Res. 9 , 309-32lTln addition specific oligonucleotides were synthesized based on nucleotide sequence 
information obtained during the sequencing procedure. 

The complete nucleotide sequence of clones pAF 2-3. pAF 2-6. and pAF 2-7 encompassing the 
chromosomal phytase gene locus Is compiled in Figure 8. a graphic representation is given in Figure 7. 

Analysis of the protein coding capacity of the complete sequence revealed that the N-terminaf amino 
acid sequence of the mature protein was encoded starting from nucleotide position 381 (the N-terminus 
disclosed by Ullah is located at position 369). Furthermore, the N-terminal amino acid sequence of the 38 
kOa and 2.5 kOa internal peptide fragments (see Figure 1B - sequences B and A) were found to be 
encoded at nucleotide positions 1101 and 1548, respectively. The open reading frame stops at nucleotide 
position 1713. 

These findings clearly prove the identity of the characterized chromosomal locus as containing phytase 
encoding DNA sequence. 

Directly upstream of the chromosomal sequence encoding the mature phytase protein, no atg start 
codon can be found within the reading frame contiguous with the mature protein open reading frame; 
however, using intron-exon boundary characteristics, an Intron can be postulated between nucleotide 
positions 254 and 355, bringing the ATG codon at nucleotide position 210 In frame with the mature phytase 
encoding open reading frame. The derived amino acid sequence of this N-termlnal extension closely fits the 
rules for a secretion signal sequence as published by von Heyne (1983. Eur. J. Biochem. 133 . 17-21). 

To confirm these hypotheses the phytase cDNA was isolated by PCR-amplification with specific 
phytase primers and a total mRNA/cDNA population as template according to the procedures described 
below. 

Isolation of poly A* RNA from Aspergillus flcuum. 

Total RNA was isolated from A. flcuum NRRL 3135 grown under Induced conditions as mentioned in 
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Example 6. Dry mycelium was trczen with liquid nitrogen and ground. Subsequently, the powder was 
homogenized In an Ultra-Tunax (full speed during 1 minute) in 3M LiCI ( 6M urea at O'C and maintained 
overnight at 4*C as described by Auffrey & Rougeon (Eur.J.BIochem., 107 , 303-314,1980). Total cellular 
RNA was obtained after centrifugation at at 16,000 g for 30 minutes and two successive extractions with 
phenol; chloroform: iseamylalcohol (50:48:2). The RNA was precipitated with ethanoi and dissolved in 1 ml 
10 mM Trts-HCt (pH 7.4), 0.5% SDS. For poly A* selection the total RNA sample was heated for 5 minutes 
at 60 * C, adjusted to 0.5 M NaCI and subsequently applied to an oligo(dT)-cellutose column. After several 
washes with a solution containing 10 mM Tris-Hd pH 7.4, 0.5% SDS and 0.1 M NaCI the poly A RNA 
was collected by elution with 10 mM Trls-HCI pH 7.4 and 0.5% SDS. 



Preparation of the mRNA/cDNA complex 

For the synthesis of the first cDNA strand 5 ug of poly A* RNA was dissolved in 16.5 ul H 2 0 and the 
following components were added: 2.5 ul RNasin (30 UM); 10 ul of a buffer containing 50 mM Tris-HCI pH 
7.6, 6 mM WlgCIa and 40 mM KCI; 2 ill 1 M KCI; 5 ul 0.1 M DTT; 0.5 ul oligo (dT); 2 -it (2.5 mg/ml); 5 ul 8 
mM dNTP-mix; 5 ul BSA (1 mg/ml) and 2.5 ul Moloney MLV reverse transcriptase (200 U/mi). The mixture 
was incubated for 30 minutes at 37 ' C and the reaction was stopped by addition of 10 ul 0.2 M EDTA and 
50 Ul H 2 0. An extraction was performed with chloroform and after centrifu gation 110 ul 5 M NH d Ac and 
440 Ul ethanoi were successively added to the supernatant. Precipitation of the mRNA/cDNA complex was 
performed in a dry ice/ethanol solution for 30 minutes. The mRNA/cDNA was collected by centrifugation, 
subsequently washed with 70% ice-cold ethanoi and dissolved in 20 ul H 2 0. 



Cloning of phytase cDNA fragments 

Isolation of the cDNA-encoding phytase sequences were performed by the polymerase chain reaction 
(PGR) in two fragments. Four synthetic oligonucleotide primers were designed based on the genomic 
phytase sequence as presented In Figure 6. # 

Oligo 1: 5 -QGGTAG.AAT.TCA.AAA.ATQ.GGC.GTCTCT.GCT.QTT.CTA-3 

Oligo 2: 5 -AGT.GAC.GAA.TTC.GTG.CTG.GTG.GAG.ATG.GTG.TCG-3' 

Oligo 3: 5-GAaCAC.CAA.GCT.GAA.GGA.TCC-3' 

Oligo 4: 5-AAA.CTaCAG.GCG.TTG.AGT.GTGjMT.GTTTAA.AGG.G-3 

Oligo 1 contains the nucleotide sequence downstream of the phytase ATG startcodon (position 210 to 
231) flanked at the 5' border by an Eco Rl-slte; oligo 2 contains the nucleotide sequence immediately 
upstream of the Sal l-site (position 1129 to 1 109) also flanked by an additional Eco Rl-site; oligo 3 contains 
the nucleotide sequence around the Bam Hl-site (position 845 to 865) and oligo 4 contains a nucleotide 
sequence positioned downstream of the phytase stcpcodon (position 1890 to 1887) flanked by an additional 
Pst l-site. 

~~ The polymerase chain reactions were performed according to the supplier of Taq-polymerase (Cetus). 
As template the solution (1.6 ul) containing the mRNA/cDNA hybrids (described above) was used and as 
primers 0.3 ug of each of the oligos 1 and 2 in the reaction to amplify the N-terminai phytase cDNA part 
and oligos 3 and 4 in the reaction to amplify the C-terminal phytase cDNA part (see Figure 8). After 
denaturation (7 minutes at 100* C) and addition of 2 U Taq-polymerase the reaction mixtures were 
subjected to 25 amplification cycles (each: 2 at 55 'C, 3' at 72 *C, 1' at 94 *C) in a DNA-ampltfier of 
Perkin-Eimer/Cetus. In the last cycle the denaturation step was omitted. After digestion (Eco Rl for the N- 
terminal cDNA part and Bam HI and Pst I for the Oterminal cDNA part), both cDNA fragments were cloned 
into the approplate sites of pTZ18R (Promega). 

The nucleotide sequence of both obtained PGR fragments was determined using the dideoxy chain 
termination technique (Sanger, supra) using synthetic oligonucleotides designed after the chromosomal 
phytase gene sequence, as primers and total amplified DNA as well as cloned cDNA fragments as 
template. The sequence of the cDNA region encoding the phytase protein and the derived amino acid 
sequence of the phytase protein are depicted in Figure 8. 

The cDNA sequence confirmed the location of the Intron postulated above, and indicated that no other 
Introns were present wtthin the chromosomal gene sequence. 

The phytase gene encodes a primary translation product of 467 amino acids (MW 51091); processing 
of the primary translation product by cleaving off the signal peptide results In a mature phytase protein of 
444 (MW 4B851) or 448 (containing the first four N-terminal amino acids as published by Ullah, MW 49232) 
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amino acids. 

Example 9 



Overexpression of phytase in Aspergilll by Introduction of additional phyta» genomjc DNA copies 

Construction expression vector pAF 2-25 

All constructs were made using standard molecular biological procecures, as described by Manlatjs et 
al (1982) Molecular cloning, A laboratory Manual. Cold Spring Harbor Laboratory, N.Y.. 
" "An expression vector pAF 2-2S was made by subcloning the 6 kb PVu II DNA fragment of the phytase 
genomic clone lambda AF201. into the Sma l-slte of P UC19. The derived plasmid was 6**9™** p ^ 
fRoure 4). As selection marker for thelrahsformation to Aspergillus, the Eco Rl/Kpn I DNA fragment <rf 
plasmld pGW325 (Wernars K. (1986). Thesis. Agriculture University. Wageningen, The Netherlands 
containing the homologous Aspergillus nldulans arnd S gene, was inserted into the Eco Rl/Kpn I sites of 
pAF 2-2. The resulting expression vector was designated pAF 2-2S and ss shown m Rgure 9. 

A. Overexpression of phytase In A. ficuum NRRL 31 35. 

The plasmid pAF 2-2S was Introduced In A. ficuum NRRL 3135 using transformation procedures as 
described by Tilbum. J. «UI.(1963) Gene 26 . 205-221 and Kelly. J. & Hynes. M. (1985) EMBO J.. 4 . 475- 
479 with the following modifications: 

- mycelium was grown on Aspergillus minimal medium (Cove. D. (1966} Biochem. Biophys. Acta _ 113 .51- 
56) supplemented with 10 mM arginine and 10 mM proline for 16 hours et 30 C in a rotary shaker at 300 

rpm; , 

- only Novozym 234 (NOVO Industri). and no helfcase. was used for formation of protoplasts; 

- after 90 minutes of protoplast formation. 1 volume of STC buffer (1.2 M sorbitol. 10 mM Tris-HCI pH 7.5, 
50 mM CaCIs) was added to the protoplast suspension and centrifuged el 2500 g at 4 C for 10 minutes in 
a swinging-bucket rotor. The protoplasts were washed and resuspended in STC-buffer at a concentration of 

. 1 pUmWDNA was added in a volume of 10 ul in TE buffer (10 mM Tris-HCI pH 7.5. 0.1 mM EDTA) to 100 
ul of the protoplast suspension; 

- after incubation of the DNA-protoplast suspension at 0 C for 25 minutes. 200 ul of PEG solution was 
added dropwise (25% PEG 4000 (Merck), 10 mM Tris-HCI pH 75. 50 mM CaCl 2 ). Subsequently. 1 ml of 
PEG solution ( 60% PEG 4000 in 10 mM Tris-HCI pH 7.5. 50 mM CaCfe) was added slowly, with repeated 
mixing of the tubes. After incubation at room temperature, the suspensions were diluted with STC-buffer. 
mixed by inversion and centrifuged at 2000 g at 4* C for 10 minutes. The protoplasts were resuspended 
aently In 200 ul STObuffer and plated on Aspergillus minimal medium with 10 mM acetamide as the sole 
nitrogen source, 15 mM CsCI. 1 M sucrose, solidified with 0.75% bacteriological agar #1 (Oxoid). Growth 
was performed at 33' C for 8-10 days. 

Single transformants designated SP4. SP7 and SP8 were isolated, purified end tested for phytase 
production In shake flasks, using the process as described in Examples 1 and 2. As a control, transformants 
possessing only the vector (amdS gene In P UC18). as well as the ^transformed host were tested. 

Strains were grown under induced conditions (see Example 6) and samples were taken after 96 hours 
of growth. Analyses were performed by measuring the phytBse activity (Table 4) and by isoelectric focusing 
polyacrylamide gelelectrophcresis (IEF-PAGE). 

Samples of equal volume were taken from fermentations of A. ficuum and A. ficuum pAF 2-2S 5P7. 
grown under identical conditions, and were applied onto an IEF-PA"GTgeT(pH-range 4.5-6, Phast-System. 
Pharmacia). The electrophoresis was performed according to the instructions of the manufacturer. Subse- 
quently, the gels were either stained with the general protein stain Cowr.assie Briliant Blue (Rgure 10B). or 
with the general phosphatase activity staining described in Example 2 (Figure 10A). 

A sample of A. ficuum phytase. purified to homogeneity (via immunoaffinity chromatography as 
described in Example 7)TwIs also applied either alone, or mixed with a culture supernatant. 



EP 0 420 358 A1 



Phytase is present in the various samples In a number of Isoforms (indicated with an asterisk), as has 
been mentioned in this invention. The two major isoenzymes are clearly visible in the purified phytase In 
lanes 3 and 4 with both staining procedures (A and B). The phytase bands are barely visible In the parent 
A. ficuum strain, and significantly increased In the pAF 2*2S SP7 transformant strain, 

6 

Table 4 



Increase of phytase production by 


transformation of A. ficuum NRRL 3135. 


Strain 


Phytase 




activity (U/ml) 


A, ficuum 


0.6 


A. ficuum + control plasmid 


0.8 


A. ficuum pAF 2-2S SP8 


7.6 


A. ficuum pAF 2-2$ SP7 


6.7 


A. ficuum pAF 2-2S SP4 


4.3 



30 



B. Overexpression of phytase in A.niger CBS 513.88. 

26 The expression vector pAF 2-2S was also introduced in A. niger CBS 513.88 by transformation 
procedures as described for A. ficuum . Single transformants were isolated, purified and tested for phytase 
production in shake flasks under induced growth conditions as described in Example 6. 

Phytase expression levels of some transformants (designated as A. niger pAF 2-2S # 8, # 20 and # 33) 
and control strains were performed as described in Example 9A and are shown in Table 5. 

50 A, niger transformants have phytase expression levels comparable with A. ficuum transformants, in 
additibnlhis result Indicates that the A. ficuum phytase promoter is active in A. niger , 

Further analysis was performed on culture medium of transformant PAF 2-2S #8 by electrophoresis on 
an IEF-PAGE gel In the pH range of 4.5-6 on a Phast-System (Pharmacia) as described above. Equal 
volumes of the culture supernatants of the A. niger parent strain and of the transformant pAF 2-2S #8, 

35 grown under identical conditions, were applledorrto the gel. The gels were run and subsequently stained as 
above. 

The parent A. niger produces a very low amount of phytase, which could not be detected by gel 
electrophoresis. TtieTitraln PAF 2-2S #8 produces approx. 80 times more phytase. and this difference is 
clearly visible In Rgure 11. 

40 Several Isoforms of the phytase enzyme are detected (indicated by asterisk). The general protein stain 
indicates that the Intensity of the phytase protein bands is dramatically increased, while no other major 
protein bands appear. 

Table 5 

4S 



Phytase production by transformation of 


A. niger CBS 513.88 with pAF 2-2S. 


Strain 


Phytase 




activity (U/ro!) 


A. niger 


0.2 


A. niger + control plasmid 


02 


A* nigerpAF2*2S#8 


14 


A. niger pAF 2-2S # 33 


5 


A* niger pAF 2-2S#20 


4 
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Example 10 



Phytase expression in A. niger transformed with expression vectors containing the A. ficuum phytase gene 
fused to the promoteTand/or signal sequences of the A. niger amyloglucosidaselR'G) gene. 



Co nstructions of the expression vectors. 

To obtain overexpression of phytase in A. niger . additional expression cassettes are derived in which 
the A. ficuum phytase gene is under control of the A. niger amyloglucosidase (AG) promoter In combination 
wrthditterenF signal sequences. In pl8FYT3 end p24FYT3 the respective 18 and 24 amino acid <aa) leader 
sequences of the A6 gene from A. niger are fused to the phytase gene fragment encoding the mature 
protein. In the expression cassette pFYT3 the AG promoter sequence is fused to the phytase encoding 
sequence including the phytase leader sequence. 



Construction of p18FYT3 

Fusion of the AG-promoter and the 18 aa AG-leader sequence to the phytase sequence encoding the 
mature protein were performed by the Polymerase Chain Reaction method. In the PCR reactions two 
different templates were used: pAF 2-2S containing the entire phytase gene as described above and pAB6- 
1, a plasmld which contains the entire AG-locus from A. niger , which was isolated from a A. niger plasmid 
library, containing 13-15 kb Hin dill fragments in pUC19. For the isolation, AG-specific oligos were used: 
AG-1 : 5 -GACAATGGCTACACCAGCACCGCAACGGACATTGTTTGGCCC-3' 
AG-2: 5 -AAGCAGCCATTGCCCGAAGCCGAT-3' 

both based on the nucleotide sequence published for A. niger (Boel et aj . (1984), EMBO J. 3 , 1097*1102; 
Boel et al . (1984). Mol. and Cell. Biol. 4 , 2306-2315). TRTbligorujcieotide probes were derived from the 
seque*nc? surrounding Intron 2: oligo AG-1 is located 3 of the intron and has a polarity identical to the AG 
mRNA and oligo AG-2 is found upstream of intron 2 and is chosen antiparallel to the AG mRNA. Ptasmid 
pAB6-1 contains the AG gene on a 14.5 kb Hin dill fragment (see Figure 12). 

As primers for the PCR-amplifications four synthetic oligonucleotides were designed with the following 
sequence: 

Oligo 1: 5 l ^CTGCAGS&&I2CAAGCTAG-3 , (an AG-specific 

sequence around the ££&RI site approx. 250 bp 

upstream the ATG initiation codon) ♦ 
Oligo 18-2 : 5 • -CGAGGCGGGGACTGCCAGTGCCAACCCTGTGCAGAC-3 • 

mature phytase <- L -> 18 AA AG-leader 
Oligo 18-3; 5 1 -GTCTGCACAGGGTTGSCACTGGCAGTCCCCGCCTCG-3 1 

18 aa AG-leader <— mature phytase 
Oligo 4: 5 1 -GGCACGASS&I££TTCAGCTT-3 1 (a phytase specific 

sequence located at the fiafiHI site on position 

861) 

The PCR was performed as described by Sski et al . (1988), Science 239 . 487-491, with minor 
modifications (see Example 8). 

To fuse the AG sequences to the phytase coding sequences two separate PCR's were carried out the 
first reaction with pAB6-1 as template and oligos 1 and 18-2 as primers to amplify a 300 b^DNA fragment 
containing the 3'-part of the AG promoter and the 18 aa AG-leader sequence flanked at the 3 -border by the 
nucleotides of the phytase gene, and the second reaction with pAF 2-2S as template and oligos 18-3 and f 
as primers to amplify a 600 bp DNA fragment contsming the 5 part of the phytase gene flanked at the 5 - 
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border by 18 nucleotides of the AG signal peptide. A schematic view of these amplifications is presented in 
Figure 13. 

The two DNA fragments generated were purified by gelelectroprxrresis and ethanol precipitation and 
used as templates in the third FOR with ollgos 1 and 4 as primers to generate the AG-phytase fusion. The 
obtained DNA fragment was digested with Eco R1 and Bam HI and sjbcioned into pTZ18R. The resulted 
fusion was sequenced and designated p18FYTl. 

The remaining (3.5 kb) upstream region of the AG-promoter was obtained by digestion of pAB8-1 with 
Kpn 1 and partially with Eco R1 and ligated to the 1.1 Kb Eco Rl,Bam HI fragment of pt8FYTl and 
subsequently cloned into the~Kpn 1 /Bam HI sites of pTZ!8R. Plasmid pl8FYT2 thus obtained is shown c 
Figure 15. 

An additional Hin dill restriction site was introduced by insertion of the synthetic fragment: 

5 1 AATTCAAGCTTG 3 f 
3' GTTCGAACTTAA 5 1 

into the Eco Rl-site (flanking the amd S-gene) of pAF 2-2S. The obtained plasmid was designated pAF 2- 
2SH (Figure 14) end is used as starting plasmid to exchange the phytase promoter sequences by the PCR 
AG-phytase fusion DNA fragments. 

For the final construction, pl8FYT2 and pAF 2-2SH were digested with Kpn I and partially with Bam Hi. 
The 4.6 kb DNA fragment of p18FYT2 and the 11 kb DNA fragment of PAF 2-2SH were isolated and 
purified by gei electrophoresis, subsequently ligated and transferred to E. coH . The derived expression 
cassette was designated p18FYT3 (Figure 15). 



Construction of p24FYT3 

Fusion of the AG-promoter and the 24 aa AG leader sequence to the mature phytase encoding 
sequence was performed by PCR-amplification as described above fcr the construction for p18FYT3 with 
the exception of the primers used. Two new primers were synthesized with the following sequence: 

Oligo 24-2: 5 1 -CGAGCCGGGGACTGCCAGGCGCTTGGAAATCACATT-3 1 
taature phytase <-*-> 24 AA AG-leader 

Oligo 24-3: 5 1 -AATGTGATTTCCAAGCGCCTGGCAGTCCCCGCCTCG-3 ■ 
24 aa AG-leader <- L -> nature phytase 

Two separate PCFVs were carried out: the first reaction with pAB 6-1 as template and oiigos 1 and 24-2 as 
primers to amplify a 318 bp DNA fragment containing the 3-part of the AG promoter and the 24 aa AG 
leader sequence flanked at the 3' -border by 18 nucleotides of the phytase gene and the second reacticn 
with pAF 2-2S as template and ollgos 24-3 and 4 as primers to ampiify a DNA fragment containing the 5 - 
part of the phytase gene flanked at the 5-border by 18 nucleotides of the 24 aa AG leader. A schematic 
view of these amplifications is presented in Figure 13. 

For the construction of the final expression cassette p24FYT3 via the intermediate plasmids p24FYTi 
and p24FYT2, the same cloning pathway/procedure was used as described for pl8FYT1 and p18FYT2 to 
derive the expression cassette pl8FYT3 (Figure 15). 

Construction of pFYT3 

Fusion of the AG-promoter to the phytase gene (including the phytase leader) sequence was also 
performed by PCR-amplification as described above for the construcSon of pl8FYT3 with the exception of 
the primers used. Two additional primers were generated with the foijowing sequence: 
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oligo fyt-2: 5 . ^caccacagacg^^ ' 
phytase leader <- J -> AG-promoter 
. oligo fyt -3 : 5 • -CATCATTACACCTCAGCAATGGGCGTCTCTGCTGTT-3 
Oligo tyt AG . proBOte r <-^-> phytase leader 

Two separate PCR's were carried out the first reaction wlth (P AB 6-1 as template and oOgoe 1 and fyt-2 
to as primers to amplify a 282 bp DNA fragment containing the 3 -part of the AG promoter flanked a the 3 - 
bor^b 8 nucleotide* of L phytase leader and the second reaction with pAF 2-2 i as tem^ate . «d 
ofeot M9 and 4 as primers to amplify a DNA-frapment contamng the 5-part of the phytase gene 
SdirS the pr,y.ase Wder) and flanked at the ('-border by 18 nudeotides of the AG-promoter. A 
schematic view of these amplifications is presented in Figure 13. 

schemata v.ewo P ^ ^ ^ intermediate plasrnWs pFYTl and 

pFv£ Je cloning pathway/procedure was used as described for P 18FYT1 and P 18FYT2 to denve 
the expression cassette pi8FYT3 (Figure 15). 

a Exp^e^ojftophyja^g^u^tte 

E coll sequences were removed from the phytase expression cassettes described above by Wn dIH 
cJ»o?/3rdt. the A. niger strain CBS 513.88 (deposited October 10, 1988) was transformed with 10 
? Q OI^'fTLt by proTeduTes as described In Example 8. Single A. niger transformants from each 
* exorwsTon cassette were isolated, and spores were streaked on selects acetamide-agar plates. Spores of 
^ZZTZ co^* from Sis grown for 3 days at 37 ' C on 0.4% ' *f 

England) agar plates. Phytase oroductJon was tested in shake flasks under the following growth conditions. 

AppSaU 1 x 10» spores were inoculated In 100 ml pre-culture medium cotrta,n,ng (per liter): 1 g 
KH 2 P^ 30 g mitose; 5 g yeasH*tract; 10 g casein-hydroiysate; 0.5 g MgS<V7H,0 and 3 g Tween 80. 

30 ™££E^* 34 ' C in a rotary shaker. 1 mi of the growing cut. re was inoculated in a 100 
ml main-ollture containing (per liter): 2 g KH.PO,; 70 g maltodextrin (Maldex MDO, Amylur^ m 0 
^east-exhact; 25 g casein-hydroiysate; 2 g fcSO.; 0.5 g MgSO^TH.O; 0.O3 g ZnCb: 0.02 g CaCfe. 0.05 g 
MnSOiM H 2 0 and FeSO*. The pH was adjusted to 5.6. a^u-h i„ 

*> The mycelium was grown for at least 140 hours. Phytase production was measured as desenbeo .n 
Example I The production results of several, random transformants obtained from each expression 
cassette are shown in Table 6. 
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TablgL_i 



Phytase production of several A. nioer CBS 513.88 strains 
transformed with plasmids containing the A. ficuum phytase 
gene under control of the A. niaer AG-promoter in 
combination with different leader sequences. 



Expression cassette Trans fornant # 



Phytase 
activity (U/ml) 



P18FYT3 
(AG-promoter/ 
18 aa AG-leader) 



pl8FYT3 # 240 
pl8FYT3 * 242 
pl8FYT3 # 243 
P18FYT3 # 244 
pl8FYT3 # 245 
pl8FYT3 # 246 
pl8FYT3 # 250 



82 
84 
62 
43 
80 
82 
110 



P24FYT3 

(AG-proaoter/ 
24 aa AG-leader) 



P24FYT3 #256 
p24FYT3 # 257 
p24FYT3 # 258 
p24FVT3 #259 
p24FYT3 I 260 
p24FYT3 I 261 
p24FYT3 #262 
p24FYT3 #265 



8 
30 
13 
33 
17 
28 
18 
12 



pFYT3 

(AG-promoter/ 
phytase leader) 



pFYT3 #205 
pFYT3 # 282 
pFYT3 # 299 
pFYT3 #302 
pFYT3 #303 
pFYT3 #304 
pFYT3 #305 
pFYT3 #312 



50 
280 

96 
220 
175 
150 
150 
140 



The data clearly show high phytase expression levels in A. niger transformants containing the phytase 
gene under the control of the A. nlger AG promoter. The data also show that the highest phytase production 
is obtained with the pFYT3~expresslon vector, which contains the phytase leBder sequence. Similar 
expression vectors containing an Intronless phytase gene after transformation to A. niger . resulted in 
phytase expression levels comparable to pFYT3 transformants of A. niger . 

In addition, electrophoresis on an IEF-PAGE gel in the pH-range of 4.5-8 was performed on culture 
supematants of transformants pFYT3 #205 and #282. Equal volumes of the culture supematants of the A. 
niger parent strain and of both transformants, grown under identical conditions, were applied onto the gel. 
runend subsequently stained as described in Example 9. The parent A niger produces a very low amount 
of phytase, which is not detected In this experiment. The strains pFff3~#205 and #282 produce approx. 
250 and 1400 times more phytase (compare phytase levels in Tables 4 and 5), and this difference Is clearly 
visible in Figure 11. Several isoforms of the phytase enzyme are detected (indicated by an asterisk), The 
general protein stain indicates that the intensity of the phytase protein bands is dramatically increased, 
while no other major protein bands appear. 



Example 11 
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Overexpression of phytase in A. ficuum and A. niger grown on an industiai scale 



A. A. ficuum 

Strain A. ficuum pAF 2-25 #4 and A. ficuum NRRL 3135 were grtmr. as described in Example 1. The 
transformam pToduced approximately 50 times more phytase as compared to the wild-type strain. 

Table 7 



10 



15 



2C 



Dverexpression of phvtase by a transformant of A. ficuum 


containing multiple phytase genes. Cells were grown as 




described in Example l. 


Hours after 


Phytase activity A. 


(U/ml Fermentation broth) 


inoculation 


ficuum NRRL 31 35 


A. ficuum pAF 2-2S #4 


0 


0 


C 


24 


0 


0 


92 


2 


142 


141 


5 


270 



26 



6. A. niger 



Strain A niger pAF 2-2S #8. a transformant of A. niger strain CBS 513.88 and the parent A. niger strain 
itself were~grow7Tas described in Example 1. The transformant produced approximately 1000 times more 
30 phytase as compared to the original A. niger parent strain (Table 8). 

Table 8 



35 



40 



Overexpression of phytase by a transformant otf A. niger 


(CBS 513.88) containing multiple phytase genes. Cells 


were grown as described In Example l. 


Hours after 


Phytase activity A. 


(U/ml fermentation broth 


inoculation 


niger CBS 513.88 


A. niger pAF 22 #8 


0 


0 


0 


24 


0 


5 


92 


0.1 


65 


141 


0.1 


95 



45 



50 



Example 12 



To contruct'the vector pREPFYT3, with which simultaneously phytase expression and AG gene 
replacement Is achieved. pFYT3 is digested with Kpn I. With the obt&ned linear Kpn I DNA fragment two 
separate ligations are performed. 
55 Ligation 1 with the Kpn l-HIn dill adaptor. 
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5 1 - CGGGGA -3 1 
3 1 -CATGGCCCCTTCGA-5 1 
Kdti I fiindlll 

Ugation 2 with the Kpn HHin dill* adaptor, in which the Hin dill restriction site will not restore after 
ligation: 

5 1 - CG6GGG -3 1 
3 1 -CTAGG CCCC CTCGA -5 1 
Kpn l Kindlll* 

Subsequently, ligation 1 is partially digested with Hin dill. After removal of the amd S containing 
fragment by gel electrophoresis, the remaining DNA fragment is recirculated by ligation and transferred to 
E coli . The obtained piasmiti is denoted pFYT3AemdS (see Figure 16). 

Ugation 2 is also digested with Hin dill and the 4 kb DNA Hin dlll/Hn dlir fragment containing the amd 

S gene, is isolated by gel electrophoresis, subsequently Bgated to a partially Hin dill digestof 
pFYT3AAamdS and transferred to EooB . The plasmid containing theatridS gene at the 3 end of the 
phytase gene is derated pFYT3INT (see Figure 17). 

To introduce the approx. 6 kb Sal l/HIn dill DNA fragment of pAB6-1, containing the 3 -flanking AG 
sequence, pFYT3INT is partially digested wth Hin dill, ligated first to the adaptor: 

5 1 -AGCTAGGGGG -3' 
3 ' TC CC CCAGCT -5 1 

Eindiii* Sail 

(in which the Hin dlir restriction site will not restore after ligation) and subsequently with the Sal l/Hn dill 
fragment of pAB6-1. After transformation to E. coli , the desired plasmid pREPFYT3, containing the 3 AG 
flanking sequence at the correct position, is obtained (Figure 18). 

Expression of phytase in A. niger by AG gene replacement 

Before transformation of A. niger with pREPFYT3, the E co« sequences in the plasmid are removed by 
Hin dill digestion and getelecfrophoresis. The A niger strain CBS 513.88 is transformed with 10 ug DNA 
fragment by procedures as described in Example 9. Selection and growth of transformants is performed as 
described in Example 9. Only a minority of the selected transformants lose AG activity (approx. 20%), 
Southern analysis of chromosal DNA is performed on AG negative and phytase positive transformants to 
verify that the AG gene is indeed replaced by the phytase gene. 



Example 13 



Conservation of the phytase gene in different species. 



To determine whether the phytase gene is highly conserved within microbial species. Southern 
ana ; yses of chromosomal DNA from ten different species were performed with the A. ficuum phytase cDNA 
as probe. 

These chromosomal DNA analyses were performed on species from filamentous fungi, yeasts and 
bacteria. As an example, only a Bmited number from each group were chosen: for filamentous fungi. 
Penicillium chrysogenum and Aspergillus niger ; for yeast. Saccharomyces cereylsiae and Kluyveromyces 
lactis ; and for the procaryotic organisms the Gram-positive species, Bacillus subtilis . Clostridium 
thermoceltum , and Streptonyces llvldans and as an example for a gram-negative bacterium Pseudomonas 
aeruginosa . 

— High molecular weight chromosomal DNA from these species was digested with Pyu II and Bam HI 
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separately and subsequently etectrophorized on a 0.7% agarose gel. 

After transfer to nitrocellulose filters, the hybridization was performed overnight at low stringency (6 x 
SSC; 60 ' C) with a ^P-labeled s'-phytase cDNA fragment (described in Example 8). Blots were washed in 6 
x SSC at room temperature and exposed to X-ray for 18 hours. 

As shown In Figures 19 a and b, descrete bands are observed in almost every lane, predicting a high 
degree of homology of the phytase gene between microbial species. 

Claims 

1. A purified and Isolated DNA sequence characterized In that said DNA sequence encodes a peptide or 
protein having phytase activity. 

2. A purified and isolated DNA sequence according to Claim 1, further characterized in that said sequence 
is derived from a microbial source. 

3. A purified and isolated DNA sequence according to Claim 1, further characterized in that said sequence 
is derived from a fungal source. 

4. A purified and isolated DNA sequence according to Claim 1. further characterized in that said sequence 
is derived from an Aspergillus source. 

5. A purified and isolated DNA sequence according to Claim 1 , further characterized in that said sequence 
is derived from an Aspergillus ficuum or an Aspergillus niger source. 

6. A purilied and isolated DNA sequence according to Claim 1, further characterized in that said sequence 
encodes a phytase which exhibits the following charactistics: 

a) provides a single band at 85 kDa on SDS-PAGE when expressed in an Aspergillus host; 

b) has an apparent molecular weight after deglycosylation in the range of about 48 • 56.5 kDa; 

c) has a specific activity of about 100 U/mg protein. 

7. A purified and isolated DNA sequence characterized in that said sequence exhibits at least one of the 
following characteristics: 

a) hybridizes to an oligonucleotide probe derived from the DNA sequence as disclosed in Figure 6; 

b) hybridizes to an oligonucleotide probe derived from the cDNA sequence as disclosed in Figure 8. 

8. An expression construct characterized in that a DNA sequence according any one of claims 1-7 is 
operably linked to a regulatory region capable of directing the expression of a protein or peptide having 
phytase activity In a suitable expression host. 

9. The expression construct of Claim 8 further characterized in thai the regulatory region also contains a 
secretory leader sequence providing for the secretion of the expressed protein or peptide having phytase 
activity. 

10. The expression construct of Claim 9 characterized in that the AG promoter is used to direct the 
expression of the protein or peptide having phytase activity. 

tl. The expression construct of Claim 10 further characterized in that a homologous phytase leader 
sequence Is used to provide for the secretion of the expressed protein or peptide having phytase activity, 

12. The expression construct of Claim 10 further characterized in that the 18 amino acid AG leader 
sequence is used to provide for the secretion of the expressed prctein or peptide having phytase activity. 

13. The expression construct of Claim 10 further characterized in that the 24 amino acid AG leader 
sequence is used to provide for the secretion of the expressed protein or peptide having phytase activity. 

14. The expression construct of Claim 9 characterized in that a homologous phytase promoter is used to 
direct the expression of the protein or peptide having phytase activity. 

15. The expression construct of Claim 14 further characterized in that a homologous phytase leader 
sequence is used to provide for the secretion of the expressed protein or peptide having phytase activity. 
18. The expression construct of Claim 14 further characterized in that the 18 ammo acid AG leader 
sequence is used to provide for the secretion of the expressed protein or peptide having phytase activity. 

17. The expression construct of Claim 14 further characterized in that the 24 amino acid AG leader 
sequence Is used to provide for the secretion of the expressed protein or peptide having phytase activity. 

18. A vector capable of transforming a host cett characterized in that said vector contains an expression 
construct according to any one of Claims 8 to 17. 

19. A vector according to Claim 18. further characterized in that said vector is a plasmid. 

20. A vector according to Clam 18, further characterized in that said vector Is a plasmid selected from the 
group consisting of pAF 28*1 . pAF 2-2S, pAF 2-2, pAF 2-3. pAF 2-4. pAF 2-6, pAF 2-7. p18FYT3, p24FYT3 
and pFYT3. 

21 . A transformed host cell characterized in that said host cell is transformed with a vector according to any 
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one of Claims 18 to 20. 

22. A transformed host cell according to Cterm 21, which is selected from the group consisting of bacteria, 
yeasts and fungi. 

23. A transformed host cell according to Claim 22, which is selected from the group consisting of 
5 Aspergillus , Trichoderma . Penicillium t Mucor , Bacillus , Kluyveromyces and Saccharomyces . 

24. A transformed host cell accordinglolSlaim 22, which is selected from the group consisting of 
Aspergillus niger , Aspergillus ficuum , Aspergillus awamo ri , Aspergillus oryzae . Trichoderma reesel , 
Mucor miehiTT KIuyveromyces TacHT , Saccharomyces cerevisiae , Bacillus subtilfs and Bacillus lichentfor* 

70 257 A process for the production of a peptide or protein having phytase activity, characterized in that a 
transformed host eel! according to any one of Claims 21 to 24 is cultured under conditions conducive to the 
production of said peptide or protein having phytase activity. 

26. A peptide or protein having phytase activity, characterized in that said peptide or protein having phytase 
activity is produced by a process according to Claim 25. 
is 27. A phytase characterized in that the phytase exhibits the following charactistics: 

a) provides a single band at 85 kDa on SDS-PAQE when expressed in an Aspergillus host; 

b) has an apparent molecular weight after degiycosylation in the range of about 48 - 56.5 kDa; 

c) has a specific activity of about 100 U/mg protein. 

28. A feed for animals characterized in that said feed contains a peptide or protein having phytase activity 
20 according to either of Claims 26 and 27. 

29. Use of a peptide or protein having phytase activity according to either of Claims 26 and 27 for the 
conversion of phytate to inositol and inorganic phosphate. 

30. A process for promoting the growth of animals characterized in that an animal Is fed a diet which is 
comprised of a feedstuff supplemented with a phytase according to either of claims 26 and 27, 

25 31 . A process for the reduction of levels of phytate in animal manure characterized in that an animal is fed a 
diet which is comprised of a feedstuff supplemented with a phytase according to either of claims 26 and 27 
in an amount effective in converting phytate contained in the feedstuff to inositol and inorganic phosphate. 
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N-TERMINAL AMINOACID SEQUENCES 

ABC 



POSITION 




01 




02 




03 




04 




06 




06 




07 




08 


mm mi 


09 


GLN 


10 


SER 


11 


SER 


12 


- • m " 


13 


ASP 


14 


THR 


16 


VAL 


16 


ASP 


17 


QLN 


18 




19 




20 




21 




22 




23 


<* 


24 




26 




26 




27 




28 




29 




30 




31 




32 




33 




34 




36 




36 




37 




38 




39 





LEU 
ALA 

VAL 
PRO 

ALA ALA 
SER SER 
ARG 
ASN 

GLN GLN 
SER SER 
SER SER 
GLY 

ASP ASP 

THR THR 

VAL VAL 

ASP ASP 

GLN 

GLY 

TYR 

GLN 

ARG 

PHE 

SER 

GLU 

THR 

SER 

HIS 

LEU 

ARG 
(GLY)*" 

GLN 

TYR 

ALA 

PRO 

PHE 

PHE 
(ASP) 

LEU 

ALA 



Fig. 1A 
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PEPTIDE AMINOACID SEQUENCES 
ABODE 



POSITION 










01 


GLN 


(TRP)* 
SEB* 


MET 


ALA 


VAL 


02 




ME?T 


SER 


VAL 


03 


GLN 


PHE 


GLN 


SER 


ASP 


04 


ALA 


ASP 


CYS 


ALA 





05 


GLU 


THR 


GLN 


GLU 


ARG 


06 


GLN 


ILE 


ALA 


LYS 


PHE 


07 


GLU 


SER 


GLU 


GLY 


PRO 


08 


PRO 


THR ' 


GLN 


TYR 


TYR 


09 


LEU 


SER 


GLU 


ASP 


THR 


10 


VAL 


THR 


PRO 


LEU 


GLY 


11 


(ARG) 


VAL 


LEU 


VAL 


- _ - 


12 


VAL 


ASP 


VAL 


VAL 


ALA 


13 


LEU 


THR 


ARG 






14 


VAL 


LYS 


VAL 






16 


ASN 


LEU 


LEU 






16 


(ASP) 


SER 


VAL 






17 


(ARG) 


PRO 


ASN 






18 


(VAL) . 


PHE 


ASP 






19 


VAL 


(CYS) 


ARG 






20 


PRO 


(ASP) 








21 




LEU 








22 




PHE 








23 




THR 









fiG. 1B 
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N-TERMINUS 100KD PROTEIN 



POSITION 


* 


01 


VAL 


02 


VAL 


03 


ASP 


04 


GLU 


05 


ARG 


06 


PHE 


07 • 


PRO 


08 


TYR 


09 


THR 


10 


GLY 
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BamHI 



SallBamHI 
Hindi III Hindlll 




pAF 1-1 

13803 bps 



Hindlll 



Hindi II 



Sail 




Hind I II 



Sail 



Figure 5 
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Figure 6 

1 GTOTACmCXGTCCTATTCGGCCTCCTCCGCTGAAG^ 
Sail 

61 GCCACCTrTQTGAGCTTCTAACCTGAACTGGTAQAGTATCACACACCATOCCAACGTGQQ 

121 ATGAAQG<K!TTATATGACWCCJpCCGGTCCQGCGCGATCGCCGTAOCTGCCACTCGC^ 

*, 

181 TCTOCAAGAMmCTrCTCATAOOCATCATCOQC^^ 

translation start 

241 ATCTCCnnCTG CCTA^ 

intron start 

301 AGOGCCCCTGMGCTCGGACTtTKTI^^ 

• intron end 

361 CCTCCGGACTGGCAGTCCCCGCCTCGAGAAATCMTC^ 

421 GOTATCAATGCTTCrCCGAQACTTCOCATCTTTGOGGTC 

431 TCWCAMCQMTCGGTCATCTCCCCTGAGGTGCCCGC 

541 ACKnaTTCTCCCaTCATGOAGCGCa^ 

601 CTCTCATTQAGOAQATCCAGCACLWCaCC^CCACCTTTGACGGAA 

661 AOACATACAACTACAGCTTGGm^AQATGACCIX^CT^ 

721 TCAACTCCQGCATCAAGTrcTACCAGCGGTACQAAICaCTCACAAQQAACATCQTrCCAT 
781 TCATCCOATCCTCTGGCTC£AGCC«OT^ 

84l TCCAQAQCAXMOCTOA A0OATCCT CGTQCCCAG<XC0GCCMTCGTCGCCCAAGATCG 

BamHI 

901 ACGTCGTCATTTCCGMGCCAGCTCATCCAACAACACTCTCCACCCAGGCACC^ 
961 TCTrCCMGACAGCOMTTGGCCGATACCGTCGAAQCCAATTTCACCCCCACGTrCGTCC 
1021 CClCC^TrCGTCAACGTCTGOAGAACGACCTGTCCGGTGTGACTCTCACAGACACA 
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Figure 6 (continued 1) 
1081 TGACCTACCTCATGGACATOTGCTCC^ 

Sail 

11*1 AGCTCrTCCCCCTTCTGTQACCTCnTCACCCATQACGAATGGATCAACTACCACrACCTCC 

1201 AGTCCTTQAAAAAaTATTACGQCCATCGTQCAGGTAACCCGCTCGGCCCGACCCAGGGCa 

1261 7CX}GGIACGCTAACGAGCTCA7CGCC<XITCT0ACCCACTCGCCTGTCCACG^ 

1321 GTICCAACCACACTITGGlACTCGAGCCCGGCTACCrmCCGCTC 

1381 CGGACTTTIXXiCATGACAACGGCATCATCTCCATTCTCTT^ 

lWl GCACTMGCCGCTATCTACCACGACCGTGGAGMT^^ 

1501 CTG^TGQACGGTTCCGriTGCTrCGCGTrr^ 

1561 AGCAG<wcK;cG<rrGGix:cGTcnxnTGGrr 

1621 CGgITGATO Cl - llXjG <mQATOTACCCO^ 

1681 GATCTGGGGGTGftTIt3GGCGaAflI<OT 

translation stop 

1741 TATCACATTGCATATCATTAGCACTTCAGOTATCTATM 
1801 ATCAATGQTX&CTGTCACKS^ 

1861 CATCACCCTITAAACA^TCACACTCAACGCACAGCCTACAMCCIAACAAACGCACAAAQA 
1921 ATATTTrACACTCCTCCCCAACGCAATACCAACCGCAATTCATCATACCTCATATAAATA 
1981 CMTACAATACAATAC8ICCATCCCTACCCTCMGTCCACWATCCTATAATCMTC 
2CA1 ACmCTTACTICTWCCCTCCCCCTCACCOTCCCAGAACTCACCCCCGAA 
2101 ACTAGTAGTAGAAGMGCAGACGACXTCTCCAC^MTCTCTTCGGCCTCTTATC^ 
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Figure 6 (continued 2) 

2161 CGCTACACAAAACCCCCACCCCGTTAGCATQCACTCACAAAATAATCAAAAATAACTAAG 
2221 AAGGAAAAAAAAGAAOAAGAMQQTTACATACTCnCTCATACAMCTCCAACSACOTATA 

2281 CATCWGATCGGCAATCCCAC^Amcn'GATATCCATCTATGMCCCATTCCCATCCCAC 

t 

2341 GTTAfflTGAnACTTTACTTAGAAGMGAAAMGGG^ 

2401 GATTGAGITAffrGCTCACC^^ 

22,61 TTCACWCICCTOm»^^ 

2521 TOATCACTCTTGOTTTO^ 

2581 TCrmcnaCTTGGOam^ 

2611 TG1MTGGQT0ATTATTQCK3MTGAAGTAGATITCGCT 

2701 QAATGGATGGATOMTACIATGGAGGCGOAAAAQ^ 

2761 TCTncrOCCIOAOOCATCACTCTCCATCT A ^^ 

2821 GCTMGfirQACTGATTCTACCACAGTGKACAAT^ 

2881 TrrAQATTAACCCQCTCffGXJXATITGCCQTAOC^ 

2 9 U1 ATTAAAACTCATCCTACAGTACAGAATAGMGTAAAAGGAGAAGAQAAAAACAAGATAAT 

3001 ACAACCAGTCCAGGTCCAITCTAQATCTCGAATGACCACCAAATAAGAAAGCAACAAOCA 

3061 AGTAAGCAAAGCATfiSOrCTAAATGAACGCCAATAACTTW 

3121 MCGCTATGCACGAATGGCTCGAMTGATTCCCTTAACTCCGTAGTATTGAGAff 

3181 GAAMGAAAAAMGAGACAGyUUWGCTGACCATGGGAAAGMGCATGATCAGTOGGGAAT 
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Figure 6 (continued 3) 

3241 GCATCTGCGGGTTOAGATAGATATGAGTTGCCTCGCAQATCCGGTGACAAGATAAGAGAA 
3301 TrCGGAGATGTGATCAGOCACTGrAACTTCATCAAQCATCGACATTCMCCQTCGOGTCT 
3361 GCCXJGITGAGATC£AA<3TTGAGAT^ 
3421 TTTTGMCCACTCTIXjACTTCATCAAGCATCAA 

3481 TGTGAGG<nTGATATGCCAGGTICGATGCCACGCAGACCCGMCCGACTGAGAAATATGA 

3541 AAAGTTGGACAGCCACTTCATCTICATCAAGCOTAAAACCCCA^^ 

36OI MCGAATCTGCGGGCroATGTGGAAATGAGACGAATGCCTCGCAGATTCGAAGAW 

3661 MTCGAClATGAACAATCACmAACTrCATCAMGCCTTAMTC^ 

3721 ATTCWGGTCTGCGGGTrGAGGTTCCTGTTGAGATGCCACGCAGAC^^ 

3781 CATTATAAGTTGGWCaAGTGTAQA(7n^CCATTGATAACCGAGATAMCAATC 

3841 CTrCATCAAAGCCTTAMTCACTCAATGGCCAC/lVlUlTWCGGTCT^ 

3901 CAAGTTGCGATGCCACQCAGACTGCAAACATTGA^ 

3961 TTMCrTCMCAAAAGCCTTTCAATCAGTCAATGG^ 

4021 GATATGCGAGTIXl^GGTGCCTCGCAQACCGCGMCATGCG^ 

4081 GAGTGCCTGGCCATTGAGtfMCGAGAGAMCAACCACTTTAAC^ 

4141 CTACTCAATGACCCGlTTGTrGGWSGTCrrOT 

4201 AGACCGCXMCATGCQATOTATCATCTAAGTTAGATGAGT 

4261 GAGAAACAACCACACTTCATGAGAGCCTTAMTTATTCMTGACCAGT^ 
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Figure 6 (continued 4) 

4321 TGCGGCTTGcrrATCcaAarrcmcxmKCTC 

4381 GACGAffltSAAGCCTGACOATCGAGAACT 

HHM TGQCITrAGrATrAGQATCGT^AGGTrKn^COATGQAACOTTCCCTTTGCGTQCG^^ 

9 

4501 CGCGACGAGCCCTt^CTCQGCQTQATICIGAAATTCTCCMTCAGGGCAGCCQC 

4561 OGCGACCXiaACGTCCTCEAGaAGCTCT^ 

1)621 AGTTACATTAAMGCCTCATAGATGffCTTTGGGTGOflTM 

4681 TixriXBAcriTGriWGTcrim 

l»74l GATTITGGMGaGCTGQATCGCTCCrTGCn^ACTCTr^ 

4801 AGTCTOATTTCGCICGGGCTQATCT^ 

4861 ACCCGCTCCTCCCTTCAAGGTCAGCTIXIATGCGCA^ 

4921 AGTTGCTTCCGGCTCrraaCTC^^ 

4981 TTCOaaCOOCGOCariTOCa^^ 

5041 TGGACCWIcmCCmJWJGTrTCATCGTGCT^ 

5101 caGCATCATTCOATCAACGCMAATrCCTCTCOTGCGCX^ 

5161 CGCCTTGACCTCGTCCACGTITrCGAAGAAGUU^GGCATCTTGT^ 

5221 GCGCTCTCCCATGCCSWGGATATCCGAAGATGCGGTCCTrcr^^ 

5281 TTCAGACGAATIGGAGGCTGGGGGAGCAATTTGTCTCCOTAGGrrGTTGTIAGGGCGGAAC 

5341 CAAGMTAGCCTTCGCCTACAACGACAAGCTCTTCGCCAAA^ 
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Figur* 6 (continued 5) 

5401 AAAACQAACCCATCCTCGTCAOTCCACCOOTOCOrCTCGQACOT 
5461 ATTCCCTCAACOCCQATCTCTQCCT 0O GGCTGCOCITCOCA 
5521 cCOCCWMttlOCACCaST^^ 

9 

5581 TCXJGffAACTCTICQATCAQCCTOQQCITCCOATOACT^ 

5641 GCCaJGOCCACXnCAACKJAGCSCGCTITGCTAAMCTGCCCA 

5701 TCCGACCmCTGCAAAACCAGCriXjTITrc 

5761 TTOACTGCTXCITGaXrmACTrC^^ 

5821 CaarrrCTGa3G^GCGCCGCTCAQATItX:AMQATTCC 

588I CTGCTTCnTCAGMGTOCOC^ 

59U1 CCTGGGCCCTCATIXXJAACCATCAAGACCA^ 

* 

6001 TCAAAAGGAAAACITfCOXXnM^ 

6061 TATCCACAWTQaiOTCClAAlTCAGOCTO 

6121 TCAGCAACCACOCGOCCmQCWCOICITCCOaAAAQAAOG^^ 

6181 TCCACGGCCT©3CTTGCGaCGTTGATTGCAQACT 

6241 CTGGTICGAGTAGCAACCTCnXSAATrGG^ 

6301 ACGGAGTAGACTGC^IGATITGGM^ 

6361 CGCGACCAGATCTCGCAGTCGT^ACGAGOA 

6421 MQCTTGOfrGCrAGCCTCCTGOQCTrCACKAQCTO 
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Figure 6 (continued 6) 

6481 AmGCOQAOTITrATCGGCm 
65^1 GTAGACTrcCACCGAAGMGTCTT^ 
6601 GACTTACGCGTCTTGCGGCTG^ 
6661 GMCTAGACaTGCTOKWrnXSrra 



6721 TTGAGTGTICTrQACAGAAAGTrATGCAA CCTCGAC 

' Sail 
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Figure 8 



-23 HOVSAVLLPLYLLSOVTSOL 



61 QCAQrrCCCCOCCTCQAflAAATCAATCCAQTrOCQATACOGTCGATCAQQCOT 

AVPA8 RNQSS'CDTVDQOYQC 
-1*1 

121 TTCTCCQAQACTrCGCATCTTTQGGG^AA 

18FSETSHLWOQYAPPFSLANE 



181 TCGGTCATCTCCCCTGAOGTOCCCOCCGGATQCAOAGTCACT^ 
38 SVISPEVPAGCRVT F AQVLS 

24l CQICATIMAGXXKJGGTATCCG&CCQACfrW^ 
58RHQAHYPTDSKaKKYSALIB 



301 QAQATCCAQCAGAACOC^CCACCTTTGACGQAAMTATOCCTTCCTQAA^CATACAAC 
78BIQQN.ATTPDOKYAFLKTYN 



361 TACA<XnT0O7KX^QATOACCTGACTCCC^^ 
98Y8LGADDLTPFOBQELVNSa 



421 ATCAAOTTCTACCAOOGXJTAC(UIATC<XJTCACAAQ 

118 I KFY QRYB8LT R M X V P FX R 8 



481 TCTQQCTCCAQCCOCXnXlATCXXXntXQW 

138SOSSRVIASOKKFIBGFQST 



Jijl AAGCrQAAGGATCCrCOTOCCCAOCCCOC^^ 

158KLKDPRAQPOQSSPKIDVVI 



601 TCCGAOGCCAGCTCATC«AACAACACTCTCmCCCAOC« 

178 SBAS8SNHTLDPQICTVFED 
661 AGCC^TrGGCCCATACCQTCOAAGCCAATrrCACCCW 

198 8 E 1 A D» VBANFTATFVP8XR 



721 CAACintnSOUUJAACCfcCCTGTCCGC^^ 

218 QRLBND L8QVTLTDTEVTYL 



781 ATOOACATaTGCTCCTTCCWCACCATCTW 

238 MDMCSFDTXSTSTVDTKLSP 
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Figure 8 (continued 1) 

961 AACCaGCTCA^ 

1021 ACTTTGGACTCOAGCCCGt^ 

1081 CATQACAAC(XH2ATCATCTCGATTCTCTT^ 
538HDH 0XX8 ' XLF 

U41 CTATCTACCACWCCGIGGAGAATATCAOT 
358 LSTTIVBNITQTD0FS5AWT 

1201 GTTCCOITlXJCTTCOCGTnXjTAOTTC 
378 VPPASBLYVEMMQCQAB««s' 

1321 TTGGGGAGATGTACCCGGGATAGCTTO 

1381 GATTQQQCGGAGTGTTTTCCTTAQ 
438DWAECFA» 
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Plant Gene Register j 

Starch Branching Enzyme II from Maize Endosperm 

Dane K. Fisher, Charles D. Boyer*, and L. Curtis Hannah 

Department of Horticulture, The Pennsylvania State University, University Park, Pennsylvania 16802 
(D.F., C.D.B.); and Department of Vegetable Crops, The University of Florida, 
Gainesville, Florida 3261 1 (LCH.) 



ADP-Glc pyrophosphorylase (EC 2.7.7.27), starch syn- 
thases (EC 2.4.1.21), and SBEs (EC 2.4.1.18) are the key 
enzymes in the pathway of plant starch biosynthesis. ADP- 
Glc pyrophosphorylase and starch synthase catalyze the for- 
mation of ADP-Glc, the substrate for starch synthesis, and 
the polymerization of Glc into the amylose and amylopectin 
fractions of the starch granule, respectively. SBEs catalyze 
branch point formation by the cleavage and reattachment of 
a-l,4-Iinked Glc chains to a-1,6 branch points in the growing 
starch molecule (Borovsky et al., 1979; Boyer, 1985). Branch- 
ing enzymes are proposed to interact with starch synthases 
in formation of amylopectin (Boyer and Preiss, 1981). 

Multiple forms of SBE have been identified in maize (Zea 
mays L.) endosperm (Boyer and Preiss, 1978, 1981). Three 
forms of SBE, I, Ha, and lib, from developing maize endo- 
sperm have been characterized by their level of branching 
activity on amylose and amylopectin and by kinetic and 
immunological properties (Boyer and Preiss, 1978; Fisher and 
Boyer, 1983; Singh and Preiss, 1985). Distinct differences 
have been shown between SBE I and Ha or lib by the above 
parameters, but only small differences exist between SBE Ha 
and lib. Genetic evidence suggests that Ha and lib are prod- 
ucts of separate genes (Boyer and Preiss, 1981; Hedman and 
Boyer, 1982). The precise role of each isoform in starch 
formation has yet to be determined. To further understand 
these enzymes, efforts to clone the genes have been under- 
taken. We report here the cloning of a SBE II cDNA from 
maize (Table I). Three XgtlO cDNA libraries were constructed 
from endosperm poly(A) + RNA 14, 22, and 29 DAP. A 
heterologous nucleic acid probe, clone pJSBJES, the cDNA for 
pea SBE I, was used to screen the 14-DAP library (Bhatta- 
charyya et al., 1990). After purifying and subcloning into 
plasmid pBluescript II SK" (Stratagene), a full-length cDNA 
of 2725 bp was isolated. 

Northern blots of total maize RNA isolated from endo- 
sperm tissue 12 DAP and probed with the cloned maize 
cDNA revealed a single transcript of approximately 2.7 kb. 
Deduced amino acid sequence was compared with the pea 
SBE I (Bhattacharyya et al., 1990), maize SBE I (Baba et al., 
1991), and rice SBE I (Nakamura et al, 1992) translated 
cDNA sequences using Intelligenetics software. Levels of 
residue identity were 71, 52, and 52%, respectively. From 
these results, we conclude that we have cloned a second 
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Table I. Characteristics of SBE II cDNA from maize endosperm 

Organism: 

Zea mays L (W64A X 182E). 
Gene Product: 

SBE (1,4-a-D-glucan 6-a-D-(1,4-a-D-glucanotransferase); 
EC 2.4.1.18); starch biosynthesis. 
Clone Type; Designation: 

cDNA, full-length; A29-III-1; pMA11 (pBluescript). 
Source: 

cDNA libraries in Agt10 constructed from maize endosperm 
poly(A) + mRNA isolated 14, 22, and 29 DAP. 
Techniques: 

Libraries screened with pea {Pisum sativum L). SBE I clone 
pJSBES (Bhattacharyya et al., 1990); three overlapping clones 
subcloned into pBluescript II SK"; double-stranded dideoxy- 
nucleotide sequencing of overlapping clones using various 
subclones and synthetic oligonucleotide primers; second 
screening with 5' and 3' ends of partial clones to obtain full- 
length clone; both strands of the full-length clone were se- 
quenced (X29-III-1; pMA11). 
Method of Identification: 
Sequence homology to other SBE clones; deduced amino acid 
(residues 58-65) identity to purified maize SBE Mb mature 
protein N-terminal sequence. 
Structural Features of Protein: 
Open reading frame of 798 amino acids; calculated M r of ma- 
ture protein of 84,772; putative 53-amino acid transit peptide 
N terminal to mature protein sequence. 
Subcellular location: 
Amyloplast. 



isoform of SBE from maize endosperm. This conclusion is 
supported by the N-terminal sequence of purified maize SBE 
lib protein, which matches the cDNA predicted amino acid 
sequence at residues 58 to 65. The additional amino acid 
residues making up the N-terminal end of the deduced 
sequence are thought to encode a transit peptide (53 amino 
acids) for routing of the protein to the amyloplast. The 
deduced molecular mass of the mature protein from this 
sequence data is 84,772 D. This is slightly larger than size 
estimates of 80,000 D based upon SDS-PAGE analysis of 



Abbreviations: DAP, days after pollination; SBE, starch branching 
enzyme. 
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purified SBE Ha and lib protein (Boyer and Preiss, 1978; 
Singh and Preiss, 1985). 
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Structure of the sucrose synthase gene on chromosome 9 of 
Zea mays L. 



Claim 67 

Sucrose synthase 




W.Werr, W.-B.Frommer, C.Maas and P.Starlinger 

Institut fur Genetik der Universitat Koln, 5000 Koln 41, FRG 
Communicated by P.Starlinger 

The structure of the shrunken gene of Zea mays encoding 
sucrose synthase (EC 2.4.1.13) was determined by (i) se- 
quencing the transcription unit and ~~ 1.2 kb of 5'-upstream 
sequences from a genomic clone, (ii) by sequencing a nearly 
full length cDNA clone and (iii) by determining the transcrip- 
tion start site by a combination of primer extension exper- 
iments with synthetic oligodeoxynucleotide primers and SI 
mapping. The sucrose synthase gene is 5.4 kb long, of which 
2746 bp are found in the mature mRNA. The gene is inter- 
rupted by 15 introns. The first two introns are ~ 1 kb and 
-0.5 kb in length, respectively, while the other introns are 
much smaller. A TATA box is located 30 bp upstream from 
the transcription start site. Approximately 610 bp upstream 
of the transcription start site a direct repeat of 16 nucleotides, 
separated by a 4-fold repetition of the sequence GGTGG is 
detected. The 16-bp sequence has similarities to a sequence 
repeat found between two promoters of a maize zein gene 
also expressed in the endosperm tissue. The transposable 
element Ds in the mutant sh-m5933 and sh-m6233 alleles is 
inserted in the seventh and first intron, respectively. The 
genomic and cDNA clones were obtained from different maize 
lines. This allows the determination of polymorphic sites 
which are frequent in 3rd codon position and absent in 1st 
and 2nd codon positions. In addition, the 3 '-untranslated 
sequence shows two duplications that may have arisen by the 
insertion and subsequent excision of transposable elements. 
Key words: sucrose synthase/transcnption signals/sequence poly- 
morphism/transposon footprints 

Introduction 

The Shrunken {Sh) locus on chromosome 9 of Zea mays encodes 
the enzyme sucrose synthase (EC 2.4. 1 . 13) which catalyzes the 
cleavage reaction of sucrose to UDP-glucose and fructose in both 
directions (Chourey and Nelson, 1976). The .enzyme is involved 
in starch metabolism of the developing endosperm. In maize 
strains homozygous for recessive sh mutations, sucrose synthase 
activity is decreased to 2 -6% and starch content in the mature 
kernels is decreased to 60%. This causes the shrunken pheno- 
type (Chourey and Nelson, 1976; Chourey, 1981). In wild-type, 
the protein encoded by the Sh locus amounts up to 3% of the 
total protein. The active form of the enzyme consists of four ident- 
ical subunits of -88 kd (Tsai, 1974). It is not known whether 
the high residual starch content in the mutants indicates that the 
enzyme is present in large excess in the wild-type or whether 
alternative pathways of starch biosynthesis are used. 

A residual sucrose synthase (B) activity is detected in sh 
mutants (Chourey, 1981; Chourey and Nelson, 1976), including 
a deletion of the Sh gene (Burr and Burr, 1981; Chaleff et at., 
1981; Doring et ai., 1981). This indicates the presence of a 

© IRL Press Limited. Oxford, England. 



second gene. In support of this idea, McCormick et aL (1982) 
detected a mRNA species and DNA fragments hybridizing weak- 
ly to a cDNA clone of the Sh gene in RNA and DNA isolated 
from maize strains homozygous for Sh deletions. The in vitro 
translation product of hybrid-selected mRNA transcribed from 
the sucrose synthase (B) gene has a similar, slightly faster electro- 
phoretic mobility than the in vitro synthesized sucrose synthase 
(A) subunit encoded by the Sh locus and is precipitated by anti- 
serum against sucrose synthase (A) protein (McCormick et al 
1982). 

The two genes seem to be regulated differently. In wild-type 
strains sucrose synthase A activity increases 40-fold during kernel 
development in the endosperm tissue, starting at day 5 - 8 after 
pollination. The enzyme activity reaches a maximum 40 days 
after fertilization and drops afterwards (Chourey, 1981). In strains 
in which the Sh gene is deleted the residual sucrose synthase B 
activity. stays at a low level. Independent of the genotype (Sh 
or sh), only sucrose synthase B is detected in the embyro of the 
developing kernel (Chourey and Nelson, 1976). 

A second reason for the interest in studying expression of 
sucrose synthase genes is the reaction catalyzed by the enzyme. 
Sucrose is the major transport form of assimilate on its way from 
the photosynthesizing leaves to the energy-consuming tissues, 
e.g., to the kernels. The observations from several plants are 
in agreement with the hypothesis that synthesis of sucrose is 
mainly catalyzed by sucrose-6-phosphate synthase (EC 2.4. 1. 14) 
and sucrose phosphatase (EC 3.1.3.24) and that sucrose synthase 
has its major role in sucrose breakdown for respiration or starch 
biosynthesis (Hawker, 1971; Downton and Hawker, 1973; 
Vieweg, 1974; Preiss and Levi, 1980). The enzyme therefore 
may be involved in the utilization of sucrose within different plant 
tissues, and may thus influence where the sucrose is stored or 
broken down, respectively. 

As a step towards the study of gene regulation during plant 
development, we present here the exon-intron structure of the 
Sh gene. In addition, we discuss some evolutionary aspects 
deduced from the comparison of different alleles of the Shrunken 
gene and DNA sequences upstream of the transcription unit which 
might be important for regulation. 

Results 

DNA sequence studies of the Shrunken gene 
For our sequence studies we used one genomic and two cDNA 
clones. A 600-bp 3'-terminal cDNA clone and a 16.3-kb genomic 
clone have been described previously (Geiser et aL, 1980, 1982). 
A 2.572-bp cDNA clone (pWWl 10/1) was isolated from a cDNA 
library kindly provided by A.Gierl and Zs.Schwarz-Sommer, 
Max-Planck-Institut fur Zuchtungsforschung, Cologne. On the 
genomic clone, the whole transcription unit and a DNA segment 
of 1 140 bp located in front of the transcription unit were sequenc- 
ed from both strands. The small cDNA clone was also sequenc- 
ed from both strands, while from the larger cDNA clone only 
one strand was sequenced. Comparison of the two clones yield- 
ed the structure shown in Figure 1, where the cDNA clone 
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Fig. 1. Exon/intron structure of the shrunken gene. The exons are marked by the numbered boxes. The size of the exons is given above the drawing, intron 
sizes below. The total length of the drawing indicates the region which has been sequenced. A partial restriction map of the genomic DNA is shown in the 
upper part. The distribution of the dinucleotide CpG is given in absolute numbers for consecutive segments of 100 bp each. 



(pWW110/l) ends within exon 3. The full sequence of both 
clones is entered in the EMBL DNA library and is available from 
the authors upon request. 

Determination of the transcription start site 
The large cDNA clone is not full length and terminates in exon 
3. This was determined by SI mapping experiments which were 
carried out in order to determine the approximate size and the 
order of the transcription unit and the size of the exons. For these 
experiments, the 7. 1-kb Clal fragment was labeled at the 5' ter- 
mini. This fragment has one end in the central part of the tran- 
scribed region, extends beyond the transcription start and contains 
all upstream sequences present in our genomic wild-type clone. 
Aliquots of the labeled fragments were cleaved with restriction 
endonucleases BamHl, BgH, Xbal, HindUl, Sphl and BgHl. The 
fragments located towards the 3' end of the gene were purified, 
hybridized to mRNA, digested with SI endonuclease, separated 
on agarose gels and subjected to autoradiography. The result of 
such an experiment is shown in Figure 2. As described in the 
sketch in Figure 2b, the longest band in each lane represents the 
full-length protected fragment, while the shorter bands are formed 
by partial SI cleavage opposite to digested intron loops. The suc- 
cessive shortening of the labeled fragments by restriction diges- 
tion should have no influence on the length of the longest bands, 
until the first exon has been (partially) removed from the DNA 
before hybridization. By this method, the first exon is located 
in the 700-bp interval between the Xbal and the HindUl site 
(Figure 1), while the second exon is located between the HinaUl 
and Sphl sites. The third exon is thus located within the central 
3.3-kb BglU fragment. It contains a PvuU site which is also pre- 
sent in the cDNA clone pWWl 10/L The position of the intron 
2/exon 3 boundary was determined by a SI mapping experiment 
with DNA fragments labeled at the 5' terminus of this PvuU site. 
The resulting SI -resistant DNA fragment was 40 bp long, thus 
the Pvull site is located 40 bp downstream of the intron/exon 
boundary (indicated in Figure 3c). The first three exons are thus 
spread over a distance of > 1.5 kb. As we had no cDNA clone 
in this region available, we decided to perform primer extension 
experiments with reverse transcriptase. 
Poly (A) + RNA isolated from developing endosperm was 



reverse transcribed in the presence of the synthetic oligodeoxy- 
nucleotide AAGCATTCCCTTGCCC as primer. The position of 
the primer sequence within exon 3 is indicated in Figure 3c. 
Reverse transcription was done on poly (A) + RNA isolated either 
from maize strains carrying the Sh allele, or as a control from 
maize carrying the deletion sh bz-m4. Reverse transcripts were 
electrophoresed on 6% denaturing polyacrylamide gels. As shown 
in Figure 3, only the wild-type RNA gives rise to a prominent 
double band at position 191/192, while many faint bands are also 
present in the control. Most likely, this strong band is a reverse 
transcript extending to the 5' end of the mRNA. The size of exon 

2 was estimated accurately from SI mapping experiments to be 
115 bp. In these experiments Sl-digested RNA/DNA hybrids 
were denatured and electrophoresed on polyacrylamide gels con- 
taining 7 M urea. After transfer to nitrocellulose and hybridiz- 
ation to radioactively labeled DNA fragments, the length of the 
individual exons could be determined precisely. 

If the known size of exon 2 and the residual bases of exon 

3 towards the 5' end of the synthetic oligonucleotides are added 
and subtracted from the length of the reverse transcript (191/192 
bases), there remain 50 or 51 nucleotides for the size of exon 
1, which is in agreement with the size estimated from SI map- 
ping experiments (Figure 2). 

To determine the position of exons 1 and 2, the primer exten- 
sion experiment was repeated in the presence of dideoxynucleo- 
tides to determine the sequence of the reverse transcript. The 
autoradiographs obtained from this experiment showed a high 
background most likely due to unspecific priming of reverse 
transcriptase as already visible in the sh bz-m4 lane of Figure 3a. 
Therefore only parts of the sequence could be unambiguously 
determined. In general, the sequence was less clear at the higher 
transcript sizes. 

Comparing this limited cDNA sequence information with the 
known genomic DNA sequence, we could recognize —70% of 
exon 2 sequence. The sequence information obtained at the 
boundary between exons 1 and 2 is shown in Figure 3c. The 
unambiguously determined nucleotides extending further than 
exon 2 were screened in a computer search against all DNA 
sequences which were available upstream of exon 2. Even with 
one mismatch allowed, homology was found only once as indi- 
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Fig. 2. (A) The DNA fragments used for hybridization to poly(A) + RNA 
are indicated above the autoradiogram. The DNA fragments carried the 
label at the 5' terminus of the left end. SI -resistant RNA/DNA hybrids 
were electrophoresed on 1.2% agarose gels. (B) A schematic drawing: 
formation of RNA/DNA hybrids, digestion with endonuclease SI and 
possible SI -resistant hybrid molecules. 



cated in Figure 3c. This position is within the interval between 
the Xbal and HindHl site which was determined above (Figure 2). 
The number of nucleotides between the last nucleotide of the 
cDNA that had been determined with certainty and the 3' end 
of the reverse transcript could be determined accurately, because 
random termination of the reverse transcriptase reaction caused 
a ladder of nucleotides in each lane. The distance was found to 
be 34 and 35 nucleotides. No dinucleotide AG, which could serve 
as an intron/exon boundary, is found in this interval within the 
DNA sequence. Thus the experiment places the transcription start 
tentatively, as shown in Figure 3. 

A Hindlll site is located within the first exon, as determined 
by the primer extension experiment. DNA was labeled at this 
Hindlll site and used for a SI protection experiment. Poly(A) + 
RNA obtained from wild-type endosperm protects DNA 
fragments of a length between 32 and 46 bp (Figure 3b). The 
four most pronounced RNA species are located at the positions 
indicated in Figure 3c. This confirms the transcription start site 



determined by the reverse transcription experiment in general, 
but places the major transcription start sites 3 - 6 bp in front of 
the terminus of the reverse transcript. The reason for this is 
unknown. The most prominent signal detected in the SI map- 
ping experiment is defined as transcription start and numbered 
+ 1 in our sequence data. The boundary between exon 2 and 
intron 2 was placed from the known size of exon 2 (1 15 bp) and 
the GT/AG rule (Breathnach and Chambon, 1981). Approximate- 
ly 40 bp upstream and downstream of this splice site (shown in 
Figure 3c) no other GT dinucleotide is found in the genomic 
sequence. 

The insertion sites ofDs in the sh-m5933 and sh-m6233 alleles 
The 4.1-kb insertion in the sh-m6233 allele is flanked by the 
duplication of the 8-bp CTTGTCCC This 8-bp sequence is found 
at positions 193-220 in intron 1. In the sh-m5933 allele (Cour- 
age-Tebbe et al. 9 1983), only the 3'-adjacent sequence of the 
30-kb insertion is known. The 8-bp sequence GTCGCTTT 
located adjacent to the insertion site on the 3' side is located at 
positions 2970-2977 in intron 7. This insertion site is 12 bp 
upstream of the boundary to exon 8. 

Protein coding capacity of the Shrunken mRNA 
The first AUG is found in exon 2 and starts an open reading 
frame of 802 amino acid codons which is terminated by a TAG 
stop codon in exon 16. The predicted mol. wt. of the sucrose 
synthase monomer is 91.731 daltons. The predicted amino acid 
composition has been confirmed by a total hydrolysis of purified 
sucrose synthase protein. 

Discussion 

We have analyzed the transcription unit of the sucrose synthase 
gene and -1.1 kb upstream of the transcription start by DNA 
sequencing. The results are discussed from 5' to 3' following 
the direction of transcription. 

The 5 '-flanking region of the sucrose synthase gene 
We have examined the DNA sequence which extends - 1 kb 
upstream of the starting point of transcription, for sequences that 
might play a role in expression of the gene. Around position 
—610 we find a 16-bp direct repeat 

TGGCGGGG A . GG AA ATA 
TGG . GGGG AGGG A A AT A 
which is separated by 22 nucleotides. A tandem duplication of 
15 bp TTTAGGAAAAAfrTAG is found in front of a zein C 
gene, which is also expressed in the endosperm tissue (Langridge 
and Feix, 1983). This duplication is located between the two pro- 
moters used for transcription of the gene and oriented opposite 
to the direction of transcription. Comparison of the duplicated 
sequences reveals a conserved part GGAAAjA found in front 
of both endosperm genes. The sequence GGAAA found in all 
four repeats is also part of a consensus sequence found in animal 
enhancer elements TGG AAA (Weiher et al, 1983). 

The 22 nucleotides separating the duplicated 16 mers in front 
of the sucrose synthase gene contain a 4-fold repetition of the 
pentanucleotide GGTGG, a fifth copy overlaps the 16 mer dupli- 
cated downstream. It is interesting to note that the trinucleotide 
GTG is present five times within this region. This trinucleotide 
is reported to be frequent in procaryotic and eucaryotic DNA 
sites where interactions with proteins are anticipated for regu- 
lation or recombination (Cheng et al., 1984). 

Short nucleotide sequences have been implicated in the co- 
ordinate expression of eucaryotic genes (Davidson et al, 1983). 
Therefore a possible role of these sequences in the expression 
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Fie. 3. (A) Primer extension experiment using the synthetic oligonucleotide from exon 3 as a primer for AMV reverse transcriptase [see (C)]. (B) Estimation 
of the transcription start by a SI mapping experiment. The Hindlll site shown in C was labeled at the 5' terminus and used for DMA sequencing and the 
RNA protection experiment. The RNA sources used in the experiments were isolated from kernels with Sh phenotype or from kernels homozygous for the 
deletion allele sh bz-m4. (C) Summary of the experiments which determine the transcription start. The DNA sequence (1) shows exon 1 and 2 and parts ot 
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of DNA fragments obtained after SI digestion of RNA/DNA hybrids. The main signal in the SI mapping experiment is designated as major transcription start 
( + 1). 



of the sucrose synthase gene is presently investigated in a heter- 
ologous plant system (Nicotiana tabacum) as well as a transient 
expression experiment in maize protoplasts. 

At position - 124 upstream of the main transcription start 
the sequence CCAAGCT has similarity to the CAAT-box 
(GG^CAATCT, Breathnach and Chambon, 1981). The sequence 
(TATTTATT) located at position -29 resembles a TATA box. 
Both positions are marked in Figure 4. 

Vie structure of the transcription unit 
Most of the gene structure was deduced from comparison of a 
nearly full length cDNA clone to the genomic DNA sequence. 
The DNA sequence of the genomic clone has been determined 
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by sequencing both strands, while only one strand of the cDNA 
clone was analyzed. As the cDNA clone pWWl 10/1 ends in ex- 
on 3 (Figure 1), the position of exon 1 and 2 within the genomic 
sequence and the 5' end of exon 3 were determined by a combi- 
nation of primer extension and SI mapping experiments. At the 
transcription start site the results from these two types of exper- 
iments do not agree completely. The primer extension experiment 
places the transcription start at position +3 and +4. In the SI 
experiment, start sites are found within a 13-bp region, extend- 
ing from -3 to +10. Since these bands are found even after 
extended incubation with SI (data not shown), they seem to really 
reflect different start sites rather than exonucleolytic digestion 
by SI. The two strongest start sites are located two nucleotides 
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- 6 8 1 TGCTATGTTTCTAGAAGGAGGGGAAACCTAATCCCCCTAATCCAATGGCGGGGAGGA^ 

- 58 1 TCTCGCTACTITITAATCCGGACAAGCTCA^^ 

- 4 8 1 GGCTGCTCTCACCAAATCAGCTGCMGAAGAITAGAGCTCAAAAGAATTACAG^^ 

-381 TarixrrccTiTCixrrcccAC^c 

- 2 8 1 AAAAAGGTCTGAACTITICCGAAACCAGCCAGCCATTGGTCT^ 

CAAT-box 

- 1 8 1 ATTTGGAAATTCTGGTTTTCAGTAA^ 
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Fig. 4. Selective DNA sequences. (A) DNA located in front of the transcription start. (B) 3' end of the sucrose synthase gene. The arrow 7 bp in front of the 
poly(A) tail in pWW 110/1 indicates the position where polyadenylation starts in the cDNA clone isolated from W23 x K55. 



upstream from those determined by the primer extension exper- 
iments. This difference may be explained by incomplete exten- 
sion due to the cap structure at the 5' end of the mRNA (Banerjee, 
1980; Nathans and Hogness, 1983). It is not clear why the micro- 
heterogeneity observed in the SI protection experiment is not 
also seen in the primer extension experiment. It is possible that 
the minor bands have not been detected in the primer extension 
experiments due to the high background found in these auto- 
radiographs. Microheterogeneity at the 5' end of the mRNA has 
also been determined for a soybean lectin gene (Vodkin et al., 

1983) as well as for the maize alcohol dehydrogenase gene 
(Dennis et at., 1984). 

The first ATG is found 72 nucleotides downstream of the main 
transcription start within exon 2. It is located within the sequence 
GAGCCATGG, which has extensive homology with other trans- 
lation start sites of eucaryotic genes (CC^CCATGG, Kozak, 

1984) . 

This ATG starts an open reading frame of 2406 bp or 802 
amino acids which predicts a mol. wt. for the protein of 91.731 
daltons. The open reading frame is followed by an untranslated 
region of 269 nucleotides between the terminal TAG and the 
poly(A) addition in our cDNA clone pWWl 10/1 . The exact poly- 
adenylation site cannot be determined, because genomic clone 
and cDNA diverge at the position of two A residues, as is often 
found in polyadenylation sites. 31 nucleotides in front of the 



poly(A) tail the sequence AATAAA is found. This signal is 
usually found in front of the poly(A) addition site (Proudfoot and 
Brownlee, 1976; Proudfoot, 1984). The sequence CAYUG is 
often observed between the AATAAA signal and the polyadenyl- 
ation site (Berget, 1984). The only sequence slightly resembling 
this signal is the pentanucleotide CAGAG located three 
nucleotides in front of the poly (A) tail. 

A cDNA clone isolated from maize line W23 x K55 by Chaleff 
et al. (1981) has a poly (A) tail starting 7 bp upstream of the 
poly(A) addition site in pWWl 10/1 (Sheldon et aL, 1983; C.Han- 
nah, personal communication). The reason for the utilization of 
different polyadenylation sites remains unclear because the cor- 
responding genomic sequences of W23 x K55 and line C are not 
known and might contain differences, which could explain this 
result. The sequence AATAAA is also found three times within 
the primary transcript, in introns 2, 6 and 1 1 (Table I). This con- 
firms the observation by Proudfoot (1984) that AATAAA is not 
sufficient for poly(A) addition to occur or else that some poly(A) 
addition products are unstable and escape detection. 

Exons and introns 

The sizes of the exons and introns are shown in Figure 1 . The 
exon-intron borders are listed in Table I. Only intron 1 (1 1 14 bp) 
and 2 (51 1 bp) are larger than 162 bp, seven of the other introns 
are even shorter than 100 bp. 
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Table I. 



Selected sequences in the transcription unit 

1080 gagccATGc translation start 

1 585 cttcagatctAATAAAaaggatatgag ATGTCCATC 

2592 gtgaatgctcAATAAAacgttctgtacttgctatgg 

3852 tttagtagtaAATAAAactagtatgtgatgttttct 

5401 TAG end of the open reading frame 

Exon/intron boundaries 



49 


GGG 


CTATGCTT . 


. intron 




. AGCTCGAATTGCAG 


TAT 


1177 


CAG 


GTGGGCTT . 


. intron 


2 . 


. TACCACTTCTACAC 


GTA 


1809 


CAG 


GTAACACT . 


. imron 


3 . 


. TTGTCTGCATATAG 


GAA 


2048 


ACA 


GTAAGTTC 


imron 


4 . 


. TCCTII 1 1 1ACCAG 


ATC 


2320 


ACG 


GTGAGCTT . 


. imron 


5 . 


GTTTTCTGTTACAG 


ACG 


2589 


TAG 


GTGAATGC , 


. intron 


6 . 


. ATGATC TGTGTTAG 


GTT 


2904 


CAG 


GTACAAAA . 


. intron 


7 . 


. CAGTCGCTTTGCAG 


GTT 


3075 


ATT 


GTATGTTT 


intron 


8 . 


. CTTATTTGTTGCAG 


GTT 


3344 


GAG 


GTATACAG . 


. intron 


9 . 


. ATTCTGTGCTGCAG 


GAT 


3544 


CAG 


GTCTGTTT . 


. intron 


10 . 


. GTACATACTTGCAG 


TGT 


3820 


AAG 


GTAGAATT . 


. intron 


11 . 


. TGTTGTTTCTGCAG 


CAA 


4125 


CAA 


GTGAGTAT 


intron 


12 . 


. TTACTTGCTTCCAG 


GTT 


4578 


CAG 


GTATATGC 


intron 


13 . 


. TTTTGTGTGGGTAG 


CCT 


4951 


GAA 


GTATGCAT . 


. intron 


14 . 


. TTTGGATTGCTCAG 


GTA 


5252 


CTG 


GTAAGCCG . 


. intron 


15 . 


. TTTCTGGAATCCAG 


GCA 



The numbers in front of the short DNA sequences shown correspond to the 
number of the first nucleotide within our complete genomic sequence 
information. The positions of the potential polyadenylation signals AATAAA 
within the transcribed region are located in introns 2, 6 and 11, 
respectively. 

The borders between exons and introns are in full agreement 
with the GT-AG rule (Breathnach and Chambon, 1981). The con- 
sensus sequence of introns at the donor site (Mount, 1982) is 
well conserved with the exception of position +6, where we 
observe a preference (seven out of 15) of C instead of T. The 
generally less-conserved acceptor site agrees well with the consen- 
sus sequence of published introns of animal genes. 

Introns 1 - 14 carry a stop codon in frame, thus preventing 
translation of the unspliced RNA. Intron 15 is an exception. It 
could be translated in-frame with exons 15 and 16, yielding a 
translation product ending at a stop codon 67 bp in front of the 
AATAAA in a region that is not translated from the mature 
mRNA. Whether this is of biological significance is not known. 

Base composition 

Three contiguous exons, no. 13, 14 and 15, are significantly more 
GC-rich (57, 58 and 60%) than the other exons (51 %). No such 
deviation is seen in the introns, which are in general more AT- 
rich than the exons. Much of the GC excess arises in third codon 
position and is thus not ascribable to the amino acid sequence. 
It will be interesting to see whether the 3-dimensional structure 
of the protein indicates that exons 13-15 form a separate 
domain. In this case, it could be discussed whether the gene is 
composed of two subgenes that have evolved separately. The lack 
of GC excess in the introns might then indicate that these have 
been added to the gene after the evolution of the coding domain. 
The dinucleotide CpG is under-represented in eucaryotic DNA 
(Bird, 1980). The distribution of CpG in the exons is shown in 
Figure 1. Again, exons 13, 14 and 15 show an excess of CpG, 
which is even higher than expected on the basis of the CG-content 
(46 expected/64 observed). 

Codon preferences 

Codon usage in the sucrose synthase gene has been tabulated and 
is available from the authors on request. An interesting deviation 
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Table II. 



Transitions: 


11 


Transversions: 6 


C-T (5), 


T-C (4) 


A^C, G-T, C-G 


A-G, 


G-A 


C-A, T-G, T-A 



Deviations between the DNA sequence of the genomic clone and the cDNA 
clone pWW110/l within the protein coding region. The G to A transition is 
a difference between the genomic clone and the cDNA clone pKS500. 



from a random usage is found for those codons where the second 
base is a T, and where A and G are used synonymously in the 
third position. In these cases, G is preferred strongly over A 
(valine GTG/A: 17/2; leucine TTG/A: 16/0; CTG/ A: 34/3). A 
preference for the TG over TA in second and third codon pos- 
ition is also found in other eucaryotic genes (Beyreuther ex al. , 
1983) but to a lesser extent. The codon preference described 
results in a measurable though numerically smaller preference 
for TG over TA dinucleotides in the exons. No such preference 
is seen in the introns. This might indicate that the selection leading 
to this preference is exerted at the level of translation. It has not 
been reported, however, that these codons are served by different 
tRNA molecules (Sprinzl and Gauss, 1984). 

Evolutionary aspects 

The genomic clone sequenced by us was isolated from one of 
McClintock's strains. Our long cDNA clone pWW110/l was 
isolated from line C which is a corn belt dent. Our short cDNA 
clone pKS500 is derived from the German commercial hybrid 
EDO. The genomic clone described by Sheldon et al (1983) was 
isolated from Black Mexican Sweet. Some sequence information 
from the 3' end of the gene of this clone is available. We could 
thus search for sequence heterogeneities between these maize 
lines. 

A comparison between the genomic and the cDNA sequence 
yields information about the exons. Since our cDNA clone 
(pWW 110/1) is not complete at the 5' end and since we have 
not sequenced that part of it which was already known from the 
cDNA clone pKS500, we can compare a length of 2100 bp of 
exon DNA only. In the exon sequences investigated, we find 16 
base substitutions, both transitions and transversions (see 
Table II). Though the cDNA clone has been sequenced from one 
strand only, we have inspected our gels carefully and could read 
the positions in question unambiguously. Whether the difference 
arises from mistakes of the reverse transcriptase cannot be ex- 
cluded without sequencing another cDNA clone. It is noteworthy, 
however, that all of these differences are located in third codon 
positions and none of them leads to an amino acid substitution. 
This can hardly be explained by random mistakes of the reverse 
transcriptase reaction. Within the 270-bp protein coding region 
of our previously isolated cDNA clone pKS500 (Geiser et al , 
1980) we find one transition in the 3rd codon position, which 
is also silent on the protein level. Miyata et al. (1982) reported 
an accumulation of silent base substitutions in several animal 
■genes of 5.37 x 10~ 9 /year. 

We have compared a sequence of 2100 bp length between two 
maize lines, of which 540 positions potentially lead to silent 
substitutions. As we found 16 differences between the two 
sequences, we arrived at an evolutionary distance of 6 x 10 6 years 
between the two maize lines implying a separation time of 3 x 10 6 
years. If we compare the 3 '-untranslated region of the two sucrose 
synthase isolates mentioned above, we find within 270 bp 10 base 
substitutions (in addition to two deletion/insertion mutations). 
Since the 3 '-untranslated region does not allow the distinction 
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Fig. 5. Comparison of DNA sequences found in the 3 '-untranslated region 
of different alleles of the shrunken locus, which could reflect transposon 
footprints. A possible common precursor allele is shown. 

between silent and other codon positions, we compare all sub- 
stitutions in this sequence of 270 bp with the number of substi- 
tutions in the 540 silent positions within coding sequences. The 
numbers found (16/540 and 10/270) are so similar that we must 
assume that the nucleotide exchanges in 3 '-untranslated regions 
are accumulated at similar rates and are thus no more or less 
conserved than the silent positions within the coding region. 

The calculated distance between the two alleles would mean 
that the two alleles have been separated 3 x 10 6 years ago. This 
is two orders of magnitude longer than the time that has elapsed 
since the domestication of maize (Galinat, 1977). This may either 
mean that Z. mays has not gone through a very small population 
size upon domestication and that thus much of the diversity of 
its progenitors is reflected in its present genotype, or else that 
Z. mays evolves two orders of magnitude more rapidly than other 
species. 

Comparison between these two sequences and those available 
from some other lines (Black Mexican Sweet, Sheldo etal, 1983; 
pKS500, this study; W23 x K55, Chaleff et al, 1981) detects 
a varying degree of divergence ranging from no substitutions 
between pKS500 and isolated from the German line EDO and 
the genomic clone sequenced here to 17 deviations between Black 
Mexican Sweet and line C. These results may indicate a lack 
of selection for the sequence of the 3 '-untranslated region of the 
sucrose synthase gene. 

In any case, it is interesting to note that we find no amino acid 
polymorphisms. It is surprising that a protein as large as sucrose 
synthase (mol. wt. 92 000) does not have amino acids, the re- 
placement of which is neutral enough to be detected as a sequence 
polymorphism. This result is similar, however, to findings with 
the alcohol dehydrogenase gene of Drosophilia melanogaster 
(Kreitmann, 1983) and supports his suggestion that most amino 
acid replacements are not neutral, but are rather selected against. 

Spontaneous mutations in maize are often caused by the 
insertion of transposable elements. The integration of a transposon 
creates a short sequence duplication in the host sequence. In con- 
trast to transposons of other organisms, these duplications per- 
sist upon excision of the transposon. They are, however, often 
mutated during the excision process (Sachs et aL, 1983; Bonas 
etal. 9 1984; Fedoroff et al. , 1983; Week etal., 1984). Mutations 
occurring during the excision process are explained by a hypoth- 
esis advanced by Nevers et aL (1985). 

If the integration of the transposon has occurred in an exon, 
the remaining duplication alters the predicted sequence of the 
revertant protein, if the number of remaining nucleotides is three 
or a multiple thereof. Two such instances have been described 
by Schwarz-Sommer et al. (1985). Proteins found in maize strains 
after the reversion of a transposon-induced mutation are some- 
times altered in their properties (Echt and Schwartz, 1981; 
Dooner and Nelson, 1979; Shure et al., 1983). These alterations 
may be due to the presumed insertions of one or a few additional 
amino acids. 



Schwarz-Sommer etal. (1985) have also compared intron se- 
quences of the Waxy gene of Z. mays obtained from different 
lines. They found perfect or near-perfect sequence duplications 
that could be ascribed to the insertion and subsequent excision 
of transposable elements. They advanced the hypothesis that 
transposon insertions and excisions are frequent events and may 
thus considerably contribute to the evolution of the maize genome. 
We have analyzed the 3 '-untranslated sequences of five different 
Sh alleles and found two duplications of 7 and 3 bp each that 
are present in one and missing in the other strain (Figure 5). The 
two alleles may have evolved from a common precursor. The 
7-bp duplication could be the footprint of an as yet unknown 
transposable element, or it could have been generated from an 
insertion causing a larger duplication of the insertion site by the 
deletion of one or a few base pairs (Bennetzen et al., 1984; 
Doring and Starlinger, 1984). The 3-bp duplication could be the 
result of an En/Spm insertion and subsequent excision (Schwarz- 
Sommer et aL, 1985). 

It must be asked, however, how much these transposon foot- 
prints contribute to the evolution of proteins. It is noteworthy 
in this respect that many of the transposon insertions known have 
occurred within an intron. This is also true for the three ana- 
lyzed transposon insertions in the Shrunken gene. It is conceivable 
that many transposon insertions in exons leave footprints that are 
not comparable with protein functions. Such transposon inser- 
tions will not appear to revert frequently and may thus not be 
recognized as caused by transposable elements. The above- 
mentioned observations about amino acid replacements caused 
by single base substitutions supports this assumption. It should 
be remembered, however, that enzymes found presently may well 
be adapted to their function and that a more rapid evolution may 
have occurred in the past. 

Materials and methods 

Materials 

The genomic clone SSI (Geiser et al., 1982) and the cDNA clone pKS500 (Geiser 
et aL, 1980) were described previously. pWWl 10/1 was isolated from a cDNA 
library prepared by Schwarz-Sommer et al. (1985) and isolated by standard 
methods (Maniatis et aL, 1982). 

Restriction endonucleases were purchased from BRL (Neu-Isenburg), Biolabs 
(Dreieich) or Boehringer (Mannheim). T4 polynucleotide kinase, DNA polymer- 
ase large fragment (Klenow enzyme), alkaline phosphatase and DNase I from 
Boehringer (Mannheim); T4 ligase from Biolabs (Dreieich). Endonuclease SI 
was from Sigma (Munchen). AMV reverse transcriptase was from a preparation 
of J. Beard (Life Science). 32 P-labeled nucleoside triphosphates were purchased 
from Amersham Buchler (Braunschweig). 

Subcloning and plasmid preparation 

Vectors were pBR322 (Bolivar et aL, 1977), pUC9 (Vieira and Messing, 1982) 
and pUR250 (Ruther, 1982), which were used for transformation of Escherichia 
coli K12 strains HB101 or RR1AM15 (Ruther, 1982). Plasmid DNA was prepared 
in small amounts by the alkaline lysis method (Maniatis et aL, 1982) and larger 
quantities by the method of Clewell and Helinski (1969). 

RNA preparation 

Sh RNA was isolated from the German hybrid EDO. The maize line carrying 
the allele sh bz~m4 was obtained from B.McClintock. Immature ears were frozen 
in liquid nitrogen 20 — 22 days after pollination and stored at -70°C. Frozen 
kernels were ground, incubated with proteinase K, phenol/chloroform extracted 
as described by Kloppstech and Schweiger (1976). Poly(A) + RNA was isolated 
by o!igo(dT)-cellulose affinity chromatography (Collaborative Research). RNA 
was stored in 70% ethanol at -70°C. 

DNA sequencing 

The chemical degradation method (Maxm and Gilbert, 1980) was used with modi- 
fications described by Garoff and Ansorge (1981). Ordered deletions were created 
by random DNase I cleavage within the maize insert and a unique restriction 
site in the vector (Frischauf et aL, 190). Deletions with end points at distances 
of 250-300 bp were used for sequencing both strands after labeling by poly- 
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nucleotide kinase or fill in reaction by Klenow enzyme. Some restriction sites 
used for labeling were determined by sequencing DNA fragments spanning these 
sites. 

Preparation of DNA-RNA hybrids and digestion with endonuclease SI 
DNA-RNA hybrids were prepared as described by Berk and Sharp (1979). 
5 - 10 pg of poly(A) + RNA or equivalent amount of tRNA were hybridized to 
50 ng up to 1 pg of DNA fragments within a total volume of 20 pi. Hybridiza- 
tion was carried out overnight at 43°C. Single-stranded nucleic acids were digested 
with 200 U SI endonuclease at 25°C for 30 - 60 min. Reaction was stopped by 
addition of 5 pg tRNA and 2.5 volumes ethanol. 
Primer extension assay and RNA sequencing 

The synthetic oligonucleotide AAGCATTCCCTTGCCC was synthesized by a 
380 A DNA synthesizer (Applied Biosystems) with subsequent purification of 
the 16 mer on a 20% acrylamide gel. The reverse transcriptase reaction was car- 
ried out as described by Hamlyn et al. (1978) with slight modifications. 2.5 pmol 
oligonucleotide were incubated in 50 mM Tris pH 8.3, 150 mM KG, 10 mM 
MgCl 2 , 20 mM DTT, 40 pM dTTP, dGTP. dCTP each and 6 pCi 
[a- 32 P]ATP (400 Ci/mmol) with 2 U AMV reverse transcriptase at 42 °C. After 
15 min 1 pi of chase mix (250 pM dNTP in 100 mM Tris pH 8.3, 1 U AMV 
reverse transcriptase/pl) were added and incubation continued for 15 min. 2 pg 
tRNA were added, nucleic acids were ethanol precipitated and reverse transcrip- 
tion products analyzed on 6% polyacrylamide gels containing 7 M urea. For se- 
quencing four reactions were carried out in the presence of one dideoxynucleotide 
each. 

The computer analyses of DNA sequences were run on a VAX/VMS com- 
puter version 3.5. The computer programs used were developed at the University 
of Wisconsin (Devereux et al, 1984). 
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ABSTRACT Tobacco plants were transformed by using a 
chimeric gene construction, in which a corn sucrose synthase-1 
gene (Sh) promoter was used to direct expression of the 
^-glucuronidase (Gus) reporter gene. Expression of Sh-Gus 
activity in these plants was found to be cell type specific. GUS 
activity was detected only in the phloem cells but not in any 
other cell types of vegetative tissues. In addition, Sh-Gus 
expression was found to be anaerobically inducible in tobacco 
roots. Sh-Gus was also expressed at high levels in the en- 
dosperm tissue of maturing tobacco seeds. We thus demon- 
strated that the corn Sh promoter can direct cell-type-specific 
and inducible expression in a heterologous dicotyledonous 
plant. 



Corn sucrose synthase 1 (Sh) gene, encoding sucrose syn- 
thase 1 isozyme (UDPglucose:D-fructose 2-a-D-glucosyl- 
transferase, EC 2.4.1.13), is one of the few monocotyledon- 
ous plant genes whose biochemical function has been iden- 
tified (1) and whose promoter sequence has been cloned 
(2-4). As sucrose synthase 1 isozyme is the primary enzyme 
of the various sucrose synthase isozymes, it is believed that 
it may play some important physiological role(s). The Sh gene 
was isolated from the corn shrunken-] locus, which is located 
on chromosome 9S (5). Enzyme activity studies with tissue 
extracts have shown that the sucrose synthase 1 isozyme in 
corn is expressed at high levels in endosperm, at a much 
reduced level in root, but not in green tissues or pollen (see 
ref. 6 for review). These results indicate that the Sh gene in 
corn is expressed in a tissue- or organ-specific fashion. 
Physiological studies have shown that Sh gene expression is 
anaerobically inducible in root and shoot tissues of etiolated 
corn seedlings (6, 7). 

Using a chimeric gene construction, in which a corn Sh 
gene promoter is used to control the expression of the 
^-glucuronidase (Gus) gene, we transformed tobacco plants 
and examined in detail the expression of Sh-Gus activity in 
transgenic tissues. The observed cell-specific and inducible 
expression suggests the possible use of the Sh promoter for 
targeting chimeric gene expression to specific tissues in 
transgenic plants. 

MATERIALS AND METHODS 

Construction of Promoter Genes and Vectors. The maize 
sucrose synthase promoter and first intervening sequence 
was isolated from Pvu55, a maize Pvu I genomic fragment in 
pBR327 provided by B. Burr (2). The Pvu I site at -1347, 
relative to the start of transcription, was converted to an Xho 
I site by linker mutagenesis (W. Swain, personal communi- 
cation). A 2430-base-pair (bp) Xho l/Nco I fragment, con- 
taining theS/i promoter, first intron, and translation start site 
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was isolated and inserted into a promoter-less Gus expression 
vector (8). The resulting plasmid, designated pShGus, con- 
tains the Sh promoter from -1347 through the transcription 
start site, the first intervening sequence in the transcribed but 
nontranslated region, and the Sh gene translation start site at 
the Nco I site. pShGus was then cut with Xho I and inserted 
into an Xho I cut pTV4 vector (pTV4 contains a nptJI gene 
for kanamycin selection and Ti border sequences; ref. 9) and 
was designated pTVShGus. 

A soybean rbcs promoter was isolated from pSRS2.1 
provided by R. Meagher (10). The rbcs promoter was altered 
to place an Xho I linker at about -1550 relative to the start 
of transcription and an Nco I site was placed at the translation 
start site at +45. The resulting 1595-bp A7i<? I /Nco I soybean 
rbcs promoter fragment was then inserted into a promoterless 
pCMCHOO vector at the Xho l/Nco I sites to form a soybean 
rbcs-Gus-nos 3' gene fusion designated pCMC2100. pCMC- 
2100 was then cut with Xho I and the entire plasmid was 
inserted into pTV4 as described above. A soybean heat shock 
(hs) promoter was cloned with the gene hs6871 described by 
Schoffl et al. (11). This promoter was cloned by using 
polymerase chain reaction (PCR) primers to amplify the 
promoter fragment from soybean genomic DN A and resulted 
in an Xho I site at about -420 and an Nco I site (+100) at the 
translation start site. The 520-bp Xho l/Nco I promoter 
fragment was transferred to a Gus expression vector and then 
to a transfer vector as described above for pTVShGus. 

Plant Transformation. We generated tobacco plants with 
the promoter expression DNAs described above both via 
Agrobacterium and electroporation. Tobacco stem sections 
were transformed with disarmed Ti plasmid via Agrobacte- 
rium infection as described (9). 

Alternatively, tobacco mesophyll protoplasts were elec- 
troporated by standard cell culture manipulations (12). Spe- 
cifically, we used the following conditions: 10 of plasmid 
DNA per ml, 200 fxg of salmon sperm DNA per ml (sonicat- 
ed), 2 x 10 6 cells per ml, 350 V/cm, 500 /aF, 25 msec using 
a PDS (Prototype Design Services, New York) electropora- 
tion apparatus. Transformed cells or tissues were selected on 
medium containing kanamycin (100 tig/ ml ) and plants were 
regenerated (9). Transformed plants were grown under stan- 
dard greenhouse conditions. Unless otherwise indicated, 2- 
to 3-foot-tall plants were used for assay of cell-type-specific 
expression of GUS activity. 

Histochemical Staining of GUS Activity. Stem, leaf, root, 
and other parts of plant organs were cut from transformed 
plants. Thin (0.3-0.5 mm), cross tissue sections were hand 
cut and submerged in GUS reaction mixture as described 
(13). Tissue sections in GUS reaction buffer were vacuum 
infiltrated at 300 mmHg for 2 min. Tissues were incubated at 
30°C for 1-8 hr, depending on the level of GUS activity 
expressed in test samples. Unless otherwise indicated, Sh- 
Gus tissues were stained for 8 hr, while rbcs-Gus and hs-Gus 
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tissues were stained for 1-2 hr. The reaction was stopped by 
fixing in phosphate-buffered formalin, and chlorophyll- 
containing green tissues were bleached with 75% (vol/ vol) 
ethanol. Cellular location of GUS activity was observed by 
light microscopy. Photographs were taken with a Zeiss 
photomicroscope and Kodak Gold 100 film. 

Anaerobic Induction and Heat Shock Treatment. Anaerobic 
induction of root Sh-Gus activity was performed as follows. 
Ri progeny seeds of Sh~Gus transformed tobacco plants were 
grown for 4 weeks in the greenhouse (20 cm tall). Segments 
of main root (connecting to stem, ~0.5 cm diameter) were cut 
off the plants and sliced longitudinally along the root axis to 
give two corresponding half-root samples. One set of half- 
root from each plant was submerged deep into 50 ml of MS 
medium in a 50-ml conical culture tube. Carbenicillin (100 
/xg/ml) and nystatin (100 units/ml) were added to the MS 
medium (14) to prevent growth of microorganisms. The 
submerged half-roots were vacuum infiltrated (300 mmHg; 2 
min) to provide better anaerobic conditions for the test. The 
second set of half-roots was rinsed twice in MS medium and 
placed with the cut side down in a humidified Petri dish. After 
2 days, tissue sections were cut and GUS activities in the 
corresponding two half-root samples from each plant were 
compared. For heat shock treatment, freshly cut stem and 
leaf segments (1 cm long) were submerged in MS medium (14) 
supplemented with 0.1 M glucose. Stem and leaf tissues were 
then transferred to the same medium preheated to 45°C and 
incubated at 45°C in a water bath for 5 min with occasional 
shaking. Test tissues were then transferred to a 25°C medium, 
held for 10 min, and heat shock treated again as described 
above. Heat shocked and control segments were then held in 
a humidified Petri dish for 16 hr before sectioning and 
assaying for GUS activity. 

PCR Analysis. DNA from ~10 mg of plant tissue was 
extracted, ethanol precipitated, and redissolved. One hun- 
dred nanograms of these crude DNA preparations and 20 
pmol of each of the appropriate primers were used under 
standard PCR conditions in a Perkin-Elmer thermal cycler as 
described (15). 

RESULTS 

A simplified map showing the main features of corn Sh 
promoter, soybean rbcs promoter, and soybean hs promoter 
used in our Gus gene constructions is depicted in Fig. 1. The 
transformed tobacco plants and the K x progeny were grown 
in the greenhouse, various organs were harvested, and thin 
tissue sections were stained for GUS activity and examined 
by light microscopy. 

Fig. 1A shows the expression of GUS activity observed in 
a young root section of Sh~Gus transformed tobacco plants. 
Clusters of cells that were stained blue were localized in 
phloem tissues. No activity was detected in xylem, pith, 
cortex, or epidermal tissues. At high magnification (Fig. IB), 
the cellular localization of GUS activity was found to be 
highly specific. A group of four to six cells residing at the 
center of phloem tissue clusters are highly stained, and the 
immediate parenchyma cells show little or no stain. The 
remaining cell types of the vascular cylinder, including xylem 
fiber, xylem parenchyma cells, pith parenchyma cells, peri- 
cyle, and endodermis cells were not stained. At still higher 
magnification (data not shown), we observed that two to 
three pairs of sieve elements and companion cells were 
localized at the center of each phloem tissue bundle, and both 
cell types were similarly stained with high GUS activity. The 
surrounding parenchyma cells of the phloem tissues, as 
shown in Fig. 2 B and D, express little or no activity. At this 
level of cellular resolution, we thus have used the term 
"phloem cells" (including only sieve elements and compan- 
ion cells) rather than phloem tissues to describe our results. 
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Fig. 1 . (A) Maize Sh-Gus expression cassette. The location of the 
maize Sh promoter up to the translation start site at the Nco 1 site is 
indicated by the open box. The site of transcription initiation is 
indicated by +1. The first intervening sequence (IVS-1), which falls 
in the nontranslated leader sequence, is indicated by the hatched 
box. The Gus gene (solid box, truncated) and the nos 3' poly(A) 
termination region are also shown. For PCR analysis, synthetic 
oligonucleotides were used as primers. The locations of the primers 
are shown by the solid bars (DR140, -305; DR139, +670; DR138, 
+427; DR72, +1176; DR57, +2456; KB71, +2996). The region of the 
Sh-Gus gene amplified by PCR is shown by a dotted line (see also Fig. 
3). (B) Constructs of the upstream region for soybean rbcs and 
soybean hs promoters. Promoter fragments were ligated to the Gus 
coding sequence as transcriptional fusions. 

Fig. 2C shows the expression of GUS activity observed in 
a stem section of Sh-Gus transformed tobacco plant. A 
special feature of tobacco stem is that it contains two groups 
of phloem tissues— namely, the internal and external phlo- 
ems, which are located along the two sides of the xylem. Most 
other dicotyledonous plants only have external phloem. As 
shown in Fig. 2C, GUS activity is detected in both types of 
phloem cells but not in any other cell types of the stem, 
including epidermis, xylem, and ground tissue cells of pith 
and cortex. At higher magnification (Fig. 2D), the cellular 
localization of GUS activity was found again to be highly 
specific. The phloem cells were stained, whereas the adjacent 
phloem parenchyma cells and vascular cambium cells were 
not. Phloem cell-specific expression of Sh-Gus was also 
observed in leaf, flower, and fruit tissues. Fig. 2E shows that 
only phloem cells in midrib and lateral veins of leaf tissue 
expressed high levels of GUS activity, whereas no activity 
was detectable in other leaf cell types. Phloem cell-specific 
expression of Sh-Gus activity was also observed in flower 
and fruit tissues (Fig. 2 F and G, respectively). 

In corn, the Sh gene is expressed at high levels in the 
endosperm of the immature kernal (1, 6). We have tested 
Sh-Gus expression in the endosperm of transgenic tobacco 
plants. Fig. 2 G and H shows that GUS activity could be 
readily detected in the thin endosperm tissue of immature 
tobacco seeds. In dicotyledonous plants, endosperm tissue in 
seed is formed after fertilization as a thin layer of nutritive 
tissue aligned in a concave fashion with the inner seed coat 
(16). The tissue is initially transparent and is later either 
absorbed by the developing embryo or pressed in between 
the cotyledon and seed coat in the mature seed. A tissue 
section was cut longitudinally through a maturing seed (Fig. 
2G). It shows that a high level of Sh-Gus activity is expressed 
in tobacco endosperm tissue, but there is no activity in the 
cotyledon (Fig. 2G). Fig. 2H shows that, in a younger 
immature seed and at high magnification, GUS activity in 
young endosperm is readily detected at the cellular level. To 
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Fig. 2. (Legend appears at the bottom of the opposite page.) 



Genetics: Yang and Russell 



Proc. Natl. Acad. ScL USA 87 (1990) 4147 



better show the endosperm tissue, we cut the immature seed 
longitudinally into halves and gently teased out the embryo 
and cotyledon from endosperm and seed coat tissues. The 
focal plane of Fig. 2H was adjusted close to the bottom of the 
sliced half seed. In this case, the endosperm tissue was two 
to three cell layers thick and was lined with the seed coat. 
GUS activity was detected in small round endosperm cells 
but not in large polygonal seed coat cells. When Sh-Gus 
tobacco seeds were germinated in the dark for 4 days, 
enzyme activity could be detected only in the very fine veins 
extending through the root, hypocotyl, and cotyledon tissue, 
but not in any other tissues of the seedlings (data not shown). 
Hence, the pattern of phloem cell-specific expression of 
Sh-Gus activity, observed for mature transgenic tobacco 
plants, was conserved in young tobacco seedlings. 

The same pattern of cell-specific expression for Sh-Gus in 
transgenic tobacco plants was observed for three indepen- 
dently transformed plants. Similar expression patterns were 
observed whether the plants were produced by electropora- 
tion or by Agrobacterium infection. In addition to regener- 
ated Ro plants, the selfed Ri progeny (five plants for each 
transformant) were similarly tested for their GUS expression 
patterns and the same results were obtained as shown above. 
We therefore demonstrated that GUS expression dictated by 
the maize Sh promoter in transgenic tobacco plants was 
transmitted to the next generation. 

Results showing localized GUS staining (Fig. 2 A-G) 
indicate that the maize Sh promoter directs phloem cell- 
specific expression of the Gus gene in transgenic tobacco 
plants. However, it is important to show that the results are 
not due to differential staining anomalies. For comparison, 
transgenic tobacco plants transformed with the same Gus 
gene but with different plant gene promoters (see Fig. 1) were 
tested under identical conditions for the GUS assay. Fig. 21 
shows that tobacco plants transformed with soybean rbcs 
promoter express high levels of GUS activity in leaf meso- 
phyll cells but not in phloem or other cell types of leaf tissues. 
Fig. 2J shows that, 16 hr after heat shock, tobacco plants 
transformed with the soybean heat shock promoter express 
high levels of GUS activity in virtually all cell types present 
in leaf tissues, including cortex, vascular cambium, xylem 
parenchyma, epidermis, and phloem cells. In stem, vascular 
cambium and phloem cells are apparently more responsive to 
the heat shock treatment than other cell types and were 
readily stained with GUS activity (Fig. 2K; see Fig. 2B for 
comparison). These results confirm that GUS enzyme pro- 
duced in various cell types of tobacco tissues, including 
phloem, vascular cambium, xylem parenchyma, cortex, and 
mesophyll tissues, can be nondiscriminatively detected by 
our GUS activity assay. The experiments confirm that the 
phloem cell expression pattern observed for Sh-Gus trans- 
genic plants is highly specific, is directed by the Sh promoter, 
and is not due to some unknown factors that may differen- 
tially affect activity staining in our GUS assay. 

Expression of the Sh gene in young corn seedlings can be 
induced under anaerobic conditions (6). Springer et al. (7) 
have shown that the level of Sh gene transcripts increases 10 
and 20 times in shoot and root, respectively. In Sh-Gus 
tobacco plants, we have observed that GUS activity is 
anaerobically inducible in root tissue. Using replicate half- 
root samples prepared from the root of the same plants, we 



showed (Fig. 2L) that root GUS activity is clearly induced 
under anaerobic conditions. With the short time period («30 
min) used for GUS staining in this case, only a low level of 
activity was detected in the control (aerobic) root tissues, but 
high levels of enzyme activity were observed for the anaer- 
obically treated half-root tissues. The expression of GUS 
enzyme remained phloem specific. When six Ri progeny 
from the same transgenic parent were tested for anaerobic 
induction of Sh-Gus expression in root, four responded with 
strong induction (similar to Fig. 2L). The other two plants 
showed little observable differences in GUS activity in half- 
root samples that were treated with or without anaerobic 
induction conditions, suggesting that these plants were only 
weakly induced above the basal expression level found in 
roots. Overall, our results suggest that Sh-Gus activity in root 
phloem cells of transgenic tobacco plants is anaerobically 
inducible and the Sh promoter is apparently responsible. 

PCR analysis was performed to provide molecular evi- 
dence that an intact Sh-Gus gene was introduced into tobacco 
plants. Fig. 1 indicates the positions of the primers used to 
detect the presence of each of the regions of the Sh-Gus gene. 
Primers DR140 and DR139 detected a 975-bp fragment span- 
ning the transcription start site from -305 in the promoter to 
+670 in the nontranslated intron. Primers DR138 and DR72 
detected a 749-bp region from within the intron to the 5' 
coding region of the Gus gene. Primers DR57 and KB71 could 
detect a 540-bp region from the 3' end of the Gus coding 
region into the nos 3' terminator. These three sets of PCRs 
demonstrated that the promoter, intron, and Gus gene were 
intact. Fig. 3 shows the PCR results for two transgenic plants. 
Tobacco plant 3487 was produced via Agrobacterium infec- 
tion, while tobacco plant 2713-14 is the Ri progeny of plant 
2713, which was produced via electroporation. The data 
demonstrated that transgenic plants recovered from different 
transformation methods contain intact genes and show the 
same expression patterns. 

DISCUSSION 

In this report, we demonstrated that the corn Sh gene 
promoter can direct highly specific, phloem cell-type expres- 
sion of the Gus gene in transgenic tobacco plants. Sh-Gus was 
also found to express at high levels in endosperm tissue of 
immature tobacco seeds. In addition, the corn Sh promoter 
could be anaerobically induced in phloem cells of tobacco 
root tissue. Thus, the Sh promoter can target phloem cells as 
the tissue for foreign gene expression in genetically engi- 
neered plants. As phloem tissues of various plant organs are 
the target tissues for many plant pathogens, highly specific 
phloem expression of pathogen-resistant gene(s) via the Sh 
promoter may be utilized in disease control. It can also be 
used for anaerobic induction of foreign genes in root phloem 
cells. 

Springer et al (7) have demonstrated that expression of Sh 
gene at the transcriptional level is readily detectable in shoots 
of etiolated corn seedlings (including first and second leaves). 
A very low, but detectable, transcript level is found in mature 
green leaves. Upon anaerobic stress of young seedlings, the 
level of Sh transcripts increases 10-20 times in shoot and 
root, respectively. Our results in transgenic tobacco suggest 
that the expression of the Sh promoter is best described not 



Fig. 2 (on opposite page). Histochemical localization of GUS activity in transgenic tobacco tissues. Expression of Sh-Gus gene in tobacco 
tissues: (A) Root at low magnification. (x4.) (B) Root at high magnification. (xl5.) (C) Stem. (x4.), (D) Stem. (xl5.) (£) Leaf midrib and 
mesophyll (Inset, part of a whole leaf). (F) Petal and ovary base. (G) Fruit and seed endosperm. (//) Immature seed endosperm. Expression 
oirbcs-Gus gene in tobacco tissue: (/) Leaf midrib and mesophyll. Expression of hs-Gus gene in tobacco tissues: (J) Leaf. (K) Stem. Anaerobic 
induction of Sh-Gus in root tissue of transgenic tobacco: (L) Half-root samples. (Upper) Anaerobic roots. (Lower) Control (aerobic roots), p, 
Phloem; ip, internal phloem; ep, external phloem; pp, phloem parenchyma; c, cortex; pi, pith; x, xylem; xp, xylem parenchyma; xf, xylem fiber; 
e, epidermis; v, vascular bundles; vc, vascular cambium; ed, endodermis; mr, midrib; lv, lateral vein; m, mesophyll; en, endosperm; sc, seed 
coat layers. 
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Fig. 3. PCR analysis to detect the Sh-Gus gene in transgenic 
plants. PCR was used to show that intact genes were transferred to 
the transgenic tobacco plants. Lanes 1-3, PCRs containing primers 
DR140 + DR139 and genomic DN A from Tob 3487, Tob 2713-14, and 
Havana 425 (nontransformed control), respectively. Lanes 4-6, 
reactions containing DR138 + DR72 and genomic DNA from Tob 
3487, Tob 2713-14, and Havana 425, respectively. Lanes 7-9, reac- 
tions with DR57 + KB71 and genomic DNA from Tob 3487, Tob 
2713-14, and Havana 425, respectively. Outer lanes are size markers. 

at the organ level but at the cellular level. The Sh promoter 
is specifically expressed in phloem cells and it is the relative 
abundance of phloem cells in different organs that contributes 
to the organ-specific pattern one would observe by measuring 
Sh promoter activity in leaves, shoots, or roots. If one takes 
into account the relative abundance of phloem cells in these 
various organs, the pattern of expression we see in this 
system appears to be consistent with the expression observed 
in corn by Springer et al (7). As shown in Fig. 2£, Sh-Gus 
expression is observed in leaf phloem cells. Since these cells 
account for <5% of the midrib cells and 1% of the leaf blade 
cells (estimated by microscopy), the leaf as an organ would 
be expected to show only low levels of Sh promoter expres- 
sion. Therefore, when RNA, protein, or enzyme specific 
activities are measured from whole organ extracts, highly 
specific cell-type expression of a test gene could be drasti- 
cally reduced or become nondetectable. Furthermore, effi- 
ciencies for tissue extraction are often found to vary consid- 
erably among different plant organs, this could further inter- 
fere with the biochemical assays for gene expression that is 
highly specific at the tissue or cell level. 

Rowland et ai (17) showed very recently that the Sh gene 
transcripts are expressed in a cell-type-specific manner in 
corn root tissues. In their study, only the root tissues were 
tested. Expression was detected mainly in the vascular 
tissues, to a lesser extent in pith and epidermis, and not at all 
in cortex tissues. We have demonstrated that the Sh-Gus 
activity in transgenic tobacco is expressed only in the phloem 
cells of vascular tissues, but not in xylem parenchyma, 



vascular cambium, or any other cell types of vegetative 
tissues . The differences observed for the Sh promoter expres- 
sion in the roots of corn and tobacco may be due to differ- 
ences inherent in the expression of the corn Sh promoter in 
the two species. Alternatively, they may be due to differ- 
ences in the resolution of the different assay systems used. A 
direct comparison using the same assay systems or access to 
Sh-Gus transgenic corn would address these differences. 

At present, the physiological role(s) of the sucrose syn- 
thase enzyme(s) in vegetative tissues of plants is not clear (7). 
Our observation on cell-type expression of the Sh promoter 
suggests that a physiological role of sucrose synthase activity 
may be specifically associated with the phloem tissues of 
plants— e.g., sucrose loading and transport through the 
phloem cells in vascular plants. 

The fact that the Sh-Gus gene is effectively expressed in 
tobacco phloem cells suggests that both the transcriptional 
and translational apparatus in tobacco phloem are capable of 
recognizing the signals of corn Sh promoter and its intron 
sequences and can effectively utilize them to dictate the 
induction and cell-type-specific expression of the Gus coding 
sequences. 
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clone and Dr. W. Swain for subcloning the Sh promoter fragment. 
We also thank Jim Fuller, Barry Cohen, and Joe Burkholder for 
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ABSTRACT Carotenoids are orange, yellow, or red photo- 
protective pigments present in all piastids. The first carotenoid of 
the pathway is phytoene, a colorless compound that is converted 
into colored carotenoids through a series of desaturation reactions. 
Genes coding for carotenoid desaturases have been cloned from 
microbes but not from plants. We report the cloning of a cDNA for 
pdsl, a soybean (Glycine max) gene that, based on a complemen- 
tation assay using the photosynthetic bacterium Rhodobacter cap- 
sulatus, codes for an enzyme that catalyzes the two desaturation 
reactions that convert phytoene into ^-carotene, a yellow caro- 
tenoid. The 2281-base-pair cDNA clone analyzed contains an open 
reading frame with the capacity to code for a 572-residue protein 
of predicted M T 63,851. Alignment of the deduced Pdsl peptide 
sequence with the sequences of fungal and bacterial carotenoid 
desaturases revealed conservation of several amino acid residues, 
including a dmuckotMe-binding motif that could mediate binding 
to FAD. The Pdsl protein is synthesized in vitro as a precursor that, 
upon import into isolated chloroplasts, is processed to a smaller 
mature form. Hybridization of the pdsl cDNA to genomic blots 
indicated that this gene is a member of a low-copy-number gene 
family. One of these loci was genetically mapped using restriction 
fragment length polymorphisms between Glycine max and Glycine 
soja. We conclude that pdsl is a nuclear gene encoding a phytoene 
desaturase enzyme that, as its microbial counterparts, contains 
sequence moths characteristic of flavoproteins. 



Carotenoids are photoprotective pigments present in all 
photosynthetic and many nonphotosynthetic organisms. Ab- 
sorption of visible light and photoprotection are mediated by 
a chain of conjugated double bonds, the chromophore, that is 
formed by successive desaturations of the colorless precur- 
sor phytoene (1, 2). Deduced peptide sequences of several 
microbial carotenoid desaturases have been reported (3-6). 
A maize gene (y/) involved in carotenoid biosynthesis has 
been cloned (7), but at present it is not clear whether this gene 
is regulatory or structural. 

Carotenoids are precursors of abscisic acid (8) and carot- 
enoids influence chloroplast differentiation. Abscisic acid 
and chloroplast differentiation affect nuclear gene expression 
(9-11). Thus, a knowledge of the genes of the carotenoid- 
abscisic acid pathway is required for our understanding of the 
interactions between plastid differentiation and nuclear gene 
expression during plant development. 

Norflurazon is a bleaching herbicide that inhibits the desatu- 
ration of phytoene (12). To isolate a soybean phytoene desatu- 
rase gene, we have used a heterologous probe from a gene 
responsible for norflurazon resistance in the cyanobacterium 
Synechococcus PCC7942 (13). We reasoned that since norflura- 
zon inhibits the desaturation of phytoene, the Synechococcus 
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gene could code for phytoene desaturase and could be used as a 
heterologous probe for cloning the corresponding plant genes. 
We report here the cloning and characterization of a soybean 
cDNA coding for phytoene desaturase. § 

MATERIALS AND METHODS 
Recombinant DNA Techniques. Cloning steps were de- 
scribed in ref. 14. DNA restriction endonucleases, polymer- 
ases, and ligases were used under conditions recommended 
by suppliers (New England Biolabs, Bethesda Research 
Laboratories, and Perkin-Elmer/Cetus). The PCR was used 
to amplify a portion of the Synechococcus PCC6301 genomic 
DNA using the primers 5'-TTCTTCGGTGCCTAC- 
CC(C,T)AA(C,T)AA(C,T)ATG and 5 ' -G A AC AATTTT- 
TCAAT(T,C)TC(G,A)GCCAT corresponding to the protein 
sequences FFGAYPNM and TAEIEKLF of the Synecho- 
coccus PCC7942 protein that confers resistance to norflura- 
zon (13). Hybridization conditions for library screening were 
as described in Berlyn et al. (15). The plasmid pNFPDll was 
constructed by ligating a 1857-base-pair (bp) Sst I fragment of 
pdsl cDNA into the Sst I site of pNF3 (16). 

Complementation of Rhodobacter (Rb.) capsulatus and Ca- 
rotenoid Analysis. Plasmid pNFPDll was mobilized by con- 
jugation (17) into Rb. capsulatus BPY69. Ammonia-free 
minimal medium was used for induction of the nif promoter 
(16). Rb. capsulatus cells were grown photosynthetically in 
liquid medium and carotenoids were extracted (18) and 
saponified (19). The solvent used for thin layer chromatog- 
raphy was 5% (vol/vol) acetone in petroleum ether. 

Chloroplast Import. Linearized plasmid templates were 
transcribed using T7 RNA polymerase (20). In vitro transla- 
tions were performed using [ 35 S]labeled methionine (1100 
Ci/mmol; 1 Ci = 37 GBq) and rabbit reticulocyte lysate 
(Promega) according to the vendor's protocol. Translation 
reactions were terminated with 2x import buffer (20), con- 
taining 60 mM unlabeled methionine (20). Chloroplasts were 
obtained (21) from 10- to 12-day-old pea seedlings (Pisum 
sativum). Import was as described in ref. 21, but ice instead 
of HgCl 2 was used to stop reactions. To distinguish between 
bound and imported polypeptides, piastids were treated with 
thermolysin (20). Controls were treated identically but re- 
ceived no protease. Intact piastids were repurified by cen- 
trifugation through Percoll (22), washed once with import 
buffer (containing 5 mM EDTA), and then subjected to lysis 
by resuspending pellets in 250 pX of 10 mM Hepes-KOH, pH 
8/5 mM EDTA. Total membranes (thylakoids plus enve- 
lopes) were recovered by centrifugation at 47,000 x g for 30 
min. The membranes were washed once and resuspended in 
gel sample buffer (20). The stromal fraction was recentrifuged 
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i GAATTCCTTCTACGTACTGCCGTGGTGCTTTCACCACTGCTTACCACTAA 

51 CCTTCCTCTCTCTCTCTGCCGCTGCAAGCTTGGTACTCTCAACTCAATTC 
101 TCCACCTTATTCTTTTCACTTCTTCAGCTCTTGTTTTTTCCCAAATCTAC 
151 TTTCAAAGTGCCTGAATTCTGCAACAGTAATATTAACACTCCTCTCTTTT 
201 GTTCAGGCTTTATTTCCCCAATGGCCGCTTGTGGCTATATATCTGCTGCC 

x MAACGY I SAA 

251 AACTTCAATTATCTCGTTGGCGCCAGAAACATATCCAAATTCGCTTCTTC 

11NFNYLVGARNISKFASS 
301 AGACGCCACAATTTCGTTTTCATTTGGCGGGAGCGACTCAATGGGTCTTA 

28DATI SFSFGGSDSMGLT 
351 CTTTGCGACCCGCTCCGATTCGTGCTCCTAAGAGGAACCATTTCTCTCCC 

45LRPAP IRAPKRNHFSP 
401 TTGCGTGTCGTTTGCGTCGATTATCCACGCCCAGAGCTCGAAAACACCGT 

61LRVVCVDYPRP ELENTV 
451 TAATTTCGTTGAAGCTGCTTACTTGTCTTCCACCTTTCGTGCTTCTCCGC 

78NFVEAAYLSSTFRASPR 
501 ^TrrTrTaaAftrrrTTGAACATCGTTATTG CCGGTGCAGGATTGGCTGGT 
95 pLKPLNIii [X] A | fi & fi L A fi| 
551 TTATCAACTGCAAAATATTTGGCTGATGCTGGGCATAAACCTATATTGCT 
lllLSTiKYL^DAfiHKPI L {T 
601 GGAAGCAAGAGACGTTCTAGGJTGGA^AGGJTTGCTGCATGGAAAGACAAGG 
128 £ Al R Ii V L IS fi & v 1 A A W K D K D 
651 'ATGGAGACTGGTACGAGACAGGCCTACACATCTTTTTTGGGGCTTACCCT 
145 GDWY£TfiLHI FFGAYP 
701 TATGTGCAGAACCTTTTTGGAGAACTTGGCATTAATGATCGGTTACAATG 
161YVQNLFGELGINURLQW 
751 GAAAGAGCATTCTATGATTTTTGCTATGCCAAATAAGCCTGGAGAGTTTA 
178KEHSMIFAMPHKPGEFS 
801 GTCGATTTGATTTTCCTGAAGTTCTTCCCTCCCCATTGAATGGAATATGG 

195 RFfiFPEVLPSPLNGIW 
851 GCAATATTGAGGAACAATGAGATGCTTACATGGCCAGAGAAAGTAAAATT 
211AILRNNEMLTMPEKVKF 

901 TGCAATTGGGCTTCTCCCAGCTATGCTTGGCGGACAGCCATATGTTGAGG 

228 A I G L LP AM LGGQP YV EA 

951 CTCAAGATGGTCTTTCTGTTCAAGAATGGATGAAAAAGCAGGGCGTACCT 

245 QDGLS2QEWMKKQG[V]P 
1001 GAACGGGTAGCTGATGAGGTGTTCATAGCAATGTCAAAGGCACTAAACTT 

261 ERVADEVFIAMSKALNF 
1051 CATCAATCCTGATGAACTTTCAATGCAATGTATATTGATTGCTTTAAACC 

278 INPDELSMQCILIALNR 
1101 GATTTCTTCAGGAGAAACATGGTTCTAAGATGGCCTTTTTGGATGGCAAT 

295 F1QEKHGSKMAFLDGN 
1151 CCACCCGAAAGACTTTGTATGCCAATAGTTGATTATATTCAGTCCTTGGG 

311PPERLCMPIVDYIQSLG 
1201 TGGTGAAGTTCATCTAAATTCGCGCATTCAAAAAATTGAGCTAAATGATG 

328 GEVHLNSRIQKXELNDD 
1251 ATGGAACGGTGAAGAGCTTCTTACTAAATAATGGGAAAGTGATGGAAGGG 

345 GTVKSFLLNNGKVMEG 
1301 GATGCTTATGTGTTTGCAACTCCAGTGGATATTCTGAAGCTTCTTCTACC 

361DAYyF&TPVDILKLiI,P 
1351 AGATAACTGGAAAGGGATTCCATATTTCCAGAGATTGGATAAATTAGTTG 

378 D N wEJg I P Y F Q R[L]D K I V G 
1401 GCGTCCCAGTCATAAATGTTCACATATGGTTTGACAGAAAACTGAAGAAC 

395 vPVINVHIWFDRKLKN 
1451 ACATATGATCACCTTCTCTTTAGCAGAAGTCCCCTTCTGAGTGTATATGC 

411 T Y D If] L L F S R S P L L S V Y A 
1501 TGACATGTCAGTAACTTGCAAGGAATATTATAGCCCAAACCAGTCAATGT 

428 DMSVTCK£YE]iPNQSML 
1551 TAGAGTTGGTTTTTGCACCAGCCGAAGAATGGATTTCACGTAGTGATGAT 

445 ELVFAPAEEWISRSDD 
1601 GATATTATTCAAGCCACGATGACTGAGCTTGCCAAACTCTTTCCTGATGA 

461DIIQATMTELAKLFPDE 
1651 AATTTCTGCAGACCAAAGCAAAGCAAAGATTCTCAAGTACCATGTTGTTA 
478 I SADQSj£&K'I LKY HVVK 
1701 AAACACCAAGGTCGGTTTACAAAACTGTTCCAAATTGTGAACCTTGTCGA 

495 xpRSVYKTVPNCBPCR 
1751 CCCATTCAAAGATCTCCTATAGAAGGTTTCTATTTAGCTGGAGATTACAC 
SlltlQRSPIEGFYL&fiDYT 

1801 AAAACAAAAATATTTAGCTTCAATGGAAGGCGCTGTTCTTTCTGGGAAGC 
528 KQKYLASMEGAVLSGKL 

1851 TTTGTGCACAGGCTATTGTACAGGATTCTGAGCTACTAGCTACTCGGGGC 
545 CAQAIVQDSELLATRG 

1901 CAGAAAAGAATGGCTAAAGCAAGTGTTGTGTAACAAAAACAAGAATTGAA 
561 QKRMAKASVV* 

1951 AGAGTCATGGTAGAGTACAGGAGCATCATTTCAACTTTGGCATTCTTTGT 

2001 CTGTGGTCAGGACTCAGGAGACCTTCAACTTTATTAGTTCATACGAATAA 

20 51 AGAAAGGCTCAGCTTCTGAAATTTAGCTGCACCGTCGTCAACTGTGTGCA 

2101 ATAAGCTATACGGAACAAACGACATGTGTCAACTTTAAGTCAGCCCATTG 

2151 TTTTGTTATCCTCCAATTTTCTGGATCAATGTTTGTATTGGAAAGAAATA 

2201 TGTCATTATTCAAACTTGTTTATATCCACTTTTTTTATTTATCAACATTT 

2251 GTCACAACCTTTCGTTGAGTAAAAAAAAAAAAAAAAAGAATrC 



Fig. 1. Nucleotide sequence of the pdsl cDNA coding strand and 
deduced amino acid sequence of the phytoene desaturase protein. 
Nucleotides are numbered in the 5' to 3' direction. The coding region 
starts at nt 221. EcoRl sites (GAATTC) flanking the pdsl cDNA are 
italicized. Amino acid residues with a match score of at least 3 in the 
phytoene desaturase pattern (see Fig. 2) are underlined. Residues 
showing perfect conservation (match score = 5) are shown in bold- 



at 140,000 x g for 30 min and the resultant supernatant was 
removed for SDS/PAGE. This second centrifugation step 
greatly improved the analysis of "stromal" carotenoid de- 
saturase, presumably by the partial removal of ribulose 
1,5-bisphosphate carboxylase. Samples were subjected to 
SDS/PAGE on 10% gels. Gel sample solubilization, electro- 
phoresis, and fluorography were carried out as described 
(20). 

RESULTS 

cDNA Cloning, Nucleotide Sequence, and Predicted Protein 
Sequence. A radiolabeled Synechococcus PCC6301 probe 
was used to screen 5 x 10 5 plaques of a soybean immature 
cotyledon bacteriophage A cDNA library. Two positive 
clones isolated showed inserts of similar size (data not 
shown). A plasmid (pSoysk) derived from one A clone was 
purified, and both strands were sequenced by the dideoxy- 
nucleotide termination method (23). The 2281-bp insert con- 
tains a single open reading frame that stops at nucleotide (nt) 
1931, a TAA codon (Fig. 1). The first initiation codon is 
located at nt 221. Thus, if this ATG is the initiation codon, the 
cDNA has 200 bp of 5' untranslated sequence, a 1710-bp 
coding region, and 322 bp of untranslated 3' sequence, 
followed by a poly (A) tail. 

The assignment of the ATG codon at nt 221 as the start 
codon of pdsl is supported by three lines of evidence. (/) This 
codon occurs within the sequence CCCAATGGC, which 
approximates the plant ribosome-binding consensus se- 
quence (25). (//) As shown below, this cDNA encodes a 
precursor protein (prPdsl) that is active in an in vitro chlo- 
roplast import reaction, suggesting that all pdsl 5' sequences 
coding for the prPdsl N-terminal region required for import 
are present in this cDNA. (iii) Based on primary sequence 
conservation (see below), we calculate that the prPdsl transit 
peptide has about 94 amino acid residues. If the ATG at nt 221 
is not the start codon, the transit peptide would exceed 170 
residues, which seems unlikely (26). 

The pdsl open reading frame encodes a 572-amino acid 
protein, with an estimated molecular weight of 63,851. The 
322-bp untranslated 3' region contains two putative poly (A) 
addition signals at nt 2046 (AATAAA) and nt 2100 (A AT A A). 
However, these sequences are located further upstream of 
the polyadenylylation site than the usual 15-40 nt found in 
many plant genes (27). t 

Primary Sequence Conservation Among Prokaryotic and 
Eukaryotic Carotenoid Desaturases. To identify amino acid 
residues that, by virtue of being conserved among different 
carotenoid desaturases, could be important in structure and 
function (28), we applied a local alignment algorithm (ref. 29; 
Fig. 2A) to the deduced peptide sequences of four carotenoid 
desaturases, Rb. capsulatus Crtl and CrtD, Neurospora 
crassa AM, and soybean Pdsl. The resulting pattern for the 
family contains 501 characters with 17 conserved residues 
(Fig. IB). The alignment "tree" (Fig. 2C) shows that Crtl and 
Al-1 are the most closely related sequences (match score = 
215.27), followed by CrtD (match score = 142.29) and Pdsl 
(match score = 80.13). Pdsl amino acid residues conserved 
with a match score of 3 or higher are shown in Fig. 1. From 
the N terminus the first region of conservation has the 
sequence Gly-Xaa-Giy-Xaa 2 -Gly-Xaa 3 -Ala-Xaa< r Gly. This is 
characteristic of dinucleotide-binding folds that mediate 
binding of FAD or NAD(P) (31). This motif has been ob- 
served (5, 24) at the N-terminal region of microbial carot- 
enoid desaturases. Extensive conservation has been reported 
between carotenoid desaturase residues that mediate FAD 
binding in glutathione reductase and dihydrolipoamide de- 
face type. Putative FAD-binding residues (24) are boxed. Amino acid 
residues are shown in the standard one-letter code. 
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Fig. 2. Primary protein sequence pattern from microbial and 
plant carotenoid desaturases. (A) Amino acid class hierarchy used to 
construct the pattern. Uppercase characters indicate standard one- 
letter code; lowercase characters, amino acid classes; X, any amino 
acid. Match scores are indicated on the right (modified from ref. 29). 
(B) Carotenoid desaturase similarity pattern, based on Rb. capsula- 
rs CrtI (3) and CrtD (30), N. crassa Al-1 (4), and soybean Pdsl. g, 
Gap character. (C) Dendrogram generated, including overall match- 
ing scores. For this alignment we changed CrtD at position 13 from 
R to G assuming a crtD223 mutation at the corresponding codon (24). 

hydrogenase, two disulfide oxidoreductases (24). Based on 
similar comparison of Pdsl to glutathione reductase and 
dihydrolipoamide dehydrogenase, we identify 19 Pdsl resi- 
dues that could play a role in FAD binding (Fig. 1). 

In Vitro Import of Pdsl into Pea Chloroplasts. In all micro- 
bial carotenoid desaturases analyzed, the dinucleotide bind- 
ing fold is located near the N terminus (5, 24). This plus the 
facts that these proteins are encoded by nuclear genes and are 
targeted to plastid envelopes (2, 32) suggest that the soybean 
Pdsl protein is initially synthesized as a higher molecular 
weight precursor, possessing a transit peptide that targets the 
protein to plastids. To test this we conducted in vitro chlo- 
roplast import experiments. Translation of a pdsl transcript 
resulted in the synthesis of a polypeptide (Fig. 3, lane T) 
whose apparent molecular mass in SDS/PAGE gels (=59 
kDa) is in reasonable agreement with the size of the precursor 
predicted from the DNA sequence. 




30- 



- + - + - + - + 

protease 



Fig. 3. Import of Pdsl into chloroplasts. Import reactions were 
carried out in the presence (lanes 1-4) or absence (lanes 5-8) of 5 mM 
ATP. The latter reactions also contained nigericin (0.26 /iM) and 
valinomycin (0.88 pM) to completely inhibit the production of ATP 
by photophosphorylation (33). After import, the plastids with (+) or 
without (-) protease treatment were fractionated to yield total 
membranes (lanes 1, 2, 5, and 6) and membrane-free stroma (lanes 3, 
4, 7, and 8). A photograph of the SDS/PAGE fluorogram is shown. 
All lanes received an equivalent amount of chloroplasts. Lane T 
contains translation products used for import. 

The radiolabeled polypeptide was then used for in vitro 
import into isolated pea chloroplasts. After import, intact 
chloroplasts were recovered and treated with protease, and 
membrane and soluble plastid fractions were isolated and 
analyzed by SDS/PAGE (Fig. 3). When import experiments 
were conducted in the presence of ATP (lanes 1-4), 35 S- 
labeled prPdsl entered the chloroplasts (as judged by prote- 
ase resistance) and was processed to a smaller mature form. 
Surprisingly, imported mature Pdsl was detected in both 
soluble and membrane fractions. The former could represent 
either stromal material or soluble material liberated from the 
envelope intermembrane space. Although the mobility in 
SDS/PAGE is similar for soluble and membrane-associated 
Pdsl, at present we do not know whether any differences 
exist in primary sequence between these forms. Regardless, 
no protease-resistant polypeptides were observed in any 
chloroplast fraction when import experiments were con- 
ducted in the complete absence of ATP (Fig. 3, lanes 5-8). 
Thus, like other precursor polypeptides targeted to chloro- 
plasts, the import of prPdsl is energy dependent (33). 

Complementation of a Phytoene- Accumulating Rb. capsulatus 
Mutant by pdsl. To determine whether the cloned cDNA indeed 
codes for a phytoene desaturase, we constructed a plasmid that 
incorporates the pdsl region coding for the mature protein 
under the control of bacterial transcription and translation 
signals. The resulting plasmid pNFPDll (Fig. 4) contains the 
promoter, ribosome-binding sequence, and first 25 codons of 
Rb. capsulatus nifH fused in-frame to pdsl codon 73. The nif 
(nitrogen fixation) promoter can be induced by depriving cells 
of a fixed nitrogen source (16). Plasmid pNFPDll was mobi- 
lized by conjugation into Rb. capsulatus BPY69, a strain with 
a genetic lesion in crtl, the gene that codes for phytoene 
desaturase (18). When BPY69(pNFPDll) cells were grown in 
minimal medium without fixed nitrogen, cultures turned yellow 
and, when cells were grown in ammonia-supplemented me- 
dium, they remained blue-green, the color of BPY69 cells 
lacking the soybean cDNA clone (data not shown). Pigments 
were extracted and analyzed by absorption spectroscopy (Fig. 
4B). The yellow pigment in BPY69(pNFPDll) cells grown in 
ammonia-free medium was identified as 15-cw-£-carotene, in- 
dicating that Pdsl mediates two successive desaturations from 
phytoene (Fig. AD). The identity of the compound was con- 
firmed by TLC analysis, where it was shown to comigrate with 
tomato fruit f-carotene (data not shown). 

Based on its structure and function, we assign this gene the 
name pdsl (phytoene Jesaturase 1). 

Genetic Mapping of pdsl. The pdsl cDNA was hybridized 
to blots containing Glycine max and Glycine soja DNA 
digested with 12 restriction enzymes (data not shown). The 
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Fig 4 Complementation of a Rb. capsulars carotenoid mutant 
by the pdsl cDNA. (A) Construction of the nif expression plasmid 
pNFPDll. Open rectangle, Rb. capsulars nif sequences up to mfH 
codon 25 and including the nif promoter (nifP) and translation 
initiation signal of pNF3 (16); solid boxes, pdsl coding sequences 
fused to m/DNA at the Sst I site (GAGCTC, codon 73). The first five 
pdsl codons are underlined. Wavy lines represent vector DNA The 
EcoRl site is located at the 3' end of the pdsl cDNA (see Fig. 1 . (B) 
Absorption spectra in petroleum ether of carotenoids accumulated 
by Rb. capsulars BPY69 (pNFPDll) grown in minimal medium with 
(N + ) or without (N") ammonia. Absorption maxima are indicated for 
each peak. Phytoene and f-carotene accumulate under N and N 
conditions, respectively. (C) Desaturations mediated by Pdsl. Car- 
bon atoms at which desaturation takes place are numbered. 

patterns obtained with EcoRI, BamHh and Hindlll suggest 
that pdsl is a member of a low-copy-number nuclear gene 
family (Fig. 5). Polymorphisms detected with EcoRV (Fig. 5) 
allowed mapping of one allele to linkage group 9 in a soybean 
restriction fragment length polymorphism map (A. Rafalski 
and S. Tingey, personal communication). 

DISCUSSION 

We have cloned a soybean cDNA encoding phytoene desat- 
urase, an enzyme of the carotenoid biosynthesis pathway. 
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Fig 5 Genomic blots and genetic mapping oipdsl . The enzymes 
indicated above the lanes were used to digest DNA from G soja 
(lanes 1-3) or G. max (lane 4). Solid and open circles indicate .G. max 
and G. soja pdsl alleles, respectively, detected with EcoRV. 

Results indicate that Pdsl is a nuclear-encoded chloroplast 
protein that catalyzes two successive desaturation reactions. 

The pdsl cDNA was isolated from a cDNA library pre- 
pared from immature soybean cotyledons. The low number 
of positive clones (2 in 5 x 10 5 clones) suggest that this is a 
low-abundance transcript. Indeed, we failed to detect the 
ndsl transcript in Northern blots containing up to 30 p% ot 
total soybean RNA or 2 jig of poly(A) + RNA (data not 
shown). Although at present we do not know whether the 
cDNA isolated is a full-length clone, the complementation 
and chloroplast import results indicate that all of the infor- 
mation necessary for enzymatic activity and plastid targeting 
is present in the cDNA and suggest that the N terminus of the 
cloned protein is most likely intact. 

Upon import into chloroplasts, mature Pdsl was detected 
in the soluble and membrane fractions. Carotenoids are 
membrane pigments and, based on immunological (3) and 
biochemical (32, 34) analyses, phytoene desaturase and all 
subsequent enzymes in the pathway are also membrane- 
bound Thus, it seems logical to assume that the enzymati- 
cally active form of Pdsl is also associated with membranes. 
This raises questions as to the origin of the soluble Pdsl 
protein noted above. It is possible that this form is a non- 
physiological artifact that accumulates only during the arti- 
ficial conditions used during in vitro chloroplast import. 
Alternatively, soluble Pdsl may be an intermediate in the 
membrane insertion process or it may play a nonenzymatic 
role in either the chloroplast stroma or envelope intermem- 

bl We haveiiot yet defined the chloroplast import processing 
site From a number of experiments similar to that shown in 
pig 3 it can be estimated that upon import the precursor 
polypeptide is shortened by 5-7 kDa. However, estimates 
obtained from denaturing gels are frequently associated with 
rather large errors. For microbial carotenoid desaturases, the 
dinucleotide-binding fold is located 4-9 amino acid residues 
downstream from the N terminus (4, 5, 24). Thus, if the 
situation with Pdsl is similar, the mature form would have 
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=478 residues and a deduced molecular mass of 54 kDa. This 
would be in the same range as N. crassa and Rb. capsulatus 
phytoene desaturases (24) and is in general agreement with 
the observed mobility of mature Pdsl in SDS/PAGE gels 
(Fig. 3). This line of reasoning suggests that the precursor of 
Pdsl possesses an N-terminal transit peptide of =9.8 kDa. 

We identified several amino acid residues that are con- 
served among Pdsl, Crtl, CrtD, and AM. Among these 
conserved residues, there is an N-terminal region with the 
characteristic motif of a dinucleotide binding fold (Fig. 1). 
However, several amino acid residues that are conserved in 
all microbial desaturases analyzed are not conserved in Pdsl. 
Notably absent in Pdsl is the C-terminal region containing 
the His-Pro dipeptide, which is proposed to be equivalent to 
the His-Pro active site motif of flavoprotein oxidoreductases 
(24). Thus, the presence of this dipeptide motif is not nec- 
essary for the enzymatic conversion of phytoene into £-car- 
otene. 

Expression in an Rb. capsulatus carotenoid mutant of the 
portion of pdsl coding for the mature protein resulted in the 
synthesis of ^-carotene. The most likely explanation for this 
result is that the function of Pdsl is to catalyze the two 
desaturations needed to convert phytoene into f-carotene. 
Alternatively, the specificity of Pdsl may be changed in 
bacteria. However, both genetic and biochemical evidence 
indicates that plants have a desaturase that converts phy- 
toene into ^-carotene. Mutant plants of several species 
blocked in carotenoid desaturations accumulate either phy- 
toene or f-carotene, suggesting that two distinct genes code 
for two desaturases (for review, see ref. 1). Beyer et al. (34) 
showed that the conversion of phytoene into 15-c/5-f- 
carotene and the subsequent conversion of this pigment into 
lycopene are two distinct enzymatic reactions. Hence, it 
appears that plants have two carotenoid desaturases and the 
simplest explanation for our results is that we have cloned a 
cDNA for the first of the two enzymes. 
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reading of the manuscript). We also thank P. Beyer (Freiburg) for his 
assistance in the identification of ci s-£-carotene. 
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Rice seed ADP-glucose pyrophosphorylase cDNA 
clones were isolated by screening a X expression library 
prepared from rice endosperm poly(A + ) RNA with a 
heterologous antibody raised against the spinach leaf 
enzyme and subsequently by nucleic acid hybridiza- 
tion. One cDNA plasmid, possessing about 1650 nucle- 
otides, was shown by both DNA and RNA sequence 
analysis to contain the complete ADP-glucose pyro- 
phosphorylase coding sequence of 483 amino acids. 
The primary sequence displayed a putative leader pep- 
tide presumably required for transport of this nuclear 
encoded protein into the amyloplasts, a differentiated 
starch containing plastid. The leader peptide, however, 
showed little sequence homology with transit peptides 
displayed by other known nuclear encoded proteins 
localized in the chloroplasts. A comparison of the pri- 
mary sequence of the putative mature subunit to the 
Escherichia coli pyrophosphorylase showed two re- 
gions displaying significant homology. These two con- 
served regions contain residues shown previously to be 
essential for the allosteric regulation and catalytic ac- 
tivity of the E. coli enzyme. Differences in the primary 
sequences of the plant and bacterial enzyme may re- 
flect the distinct nature of the allosteric effectors that 
control these enzymes. 



ADP-glucose pyrophosphorylase plays a pivotal role in the 
control of synthesis of starch in plants (Preiss et al, 1967; 
Preiss, 1988) and of glycogen (Preiss, 1984) in bacteria. This 
enzyme catalyzes the synthesis of the activated glucosyl do- 
nor, ADP-glucose, from Glc-l-P and ATP. The catalytic 
activity of both the plant and bacterial ADP-glucose pyro- 
phosphorylases is strongly affected by small effector mole- 
cules. Although the nature of these effector molecules differ 
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among bacterial and plant enzymes, they typically are major 
metabolites which accumulate during normal carbon metab- 
olism in these organisms (Preiss, 1978). The Escherichia coli 
enzyme is activated by Fru-1,6-P 2 and inhibited by ADP and 
AMP, whereas the plant enzyme is stimulated and suppressed 
by 3-PGA 1 and orthophosphate, respectively (Preiss, 1978). 
Results obtained from both intact leaf and chloroplast sys- 
tems support the concept that starch biosynthesis is con- 
trolled by the ratio of 3-PGA to Pj via the allosteric control 
of ADP-glucose pyrophosphorylase in vivo (Heldt et al, 1977). 
Allosteric regulation of ADP-glucose pyrophosphorylase from 
maize endosperm (Plaxton and Preiss, 1987) and potato tuber 
(Sowokinos and Preiss, 1982) has also been demonstrated in 
vitro, although it is not clear whether the levels of these 
effector molecules oscillate in these developing nonphotosyn- 
thetic tissues, as detected in chloroplasts of leaf tissue. Never- 
theless, the activity of the above two nonphotosynthetic ADP- 
glucose pyrophosphorylases is highly dependent on the pres- 
ence of the allosteric activator, 3-phosphoglycerate. 

ADP-glucose pyrophosphorylases from both E. coli and 
Salmonella typhimurium are homotetramers with subunit mo- 
lecular weights of about 49,000 (Preiss, 1984). Unlike the 
bacterial enzymes, the plant ADP-glucose pyrophosphorylase 
exhibits more structural complexity. The homogeneous spin- 
ach leaf enzyme was observed to have a native molecular 
weight of 206,000 (Copeland and Preiss, 1981) and composed 
of dissimilar subunits of 54,000 and 51,000 by sodium dodecyl 
sulfate-polyacrylamide gel electrophoresis (Copeland and 
Preiss, 1981; Morell et al, 1987a). These subunits can be 
distinguished on the basis of their N-terminal sequences, 
tryptic peptide patterns, and immunological reactions (Morell 
et al, 1987a). Immunological related subunits of similar sizes 
have also been observed in leaf tissue of Arabidopsis thaliana 
(Lin et al, 1988a, 1988b), wheat, rice, and maize (Krishnan 
et al, 1986). The isolation of two nuclear mutations that 
affect the level of one or both subunits in Arabidopsis (Lin et 
al, 1988a, 1988b) supports the postulate that the plant enzyme 
is a product of two different genes. 

Functional studies with pyridoxal phosphate, an analog of 
the allosteric activator 3-PGA, suggest a role for both subunits 
in enzyme function (Morell et al, 1987b, 1988). To learn more 
about the structure of the plant enzyme we have cloned the 
complete coding sequence of an ADP-glucose pyrophosphor- 
ylase transcript from rice endosperm. Structural analysis has 
shown that this plant subunit shares significant homology at 
the amino acid level with the E. coli polypeptide. Despite the 
relative conservation of sequences near the N terminus, shown 



1 The abbreviations used are: 3-PGA, 3-phosphoglycerate; ORF, 
open reading frame; bp, base pair(s). 
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earlier to be essential for allosteric regulation of the bacterial 
enzyme (Larsen et ai, 1986; Parsons and Preiss, 1978), resi- 
dues essential for catalytic control of the plant enzyme are 
located near the C terminus (Morell et a/., 1987b, 1988). As 
discussed in more detail herein, we suggest that the differences 
in the allosteric specificity exhibited by the bacterial and 
plant enzyme may be accounted for, at least in part, by 
structural differences mediated by specific segmental and 
point mutations. 

MATERIALS AND METHODS 

Plant Material— Rice seeds (Oryza sativa L. cv. Biggs M201) were 
germinated and grown in an environment-controlled growth chamber 
as described earlier (Krishnan et al, 1986). Seeds were collected at 
12-15 days after flowering and stored at —80 °C. 
cDNA Library Construction and Screening — All labware and solu- 
- tions, where appropriate, were sterilized by baking at 180 °C for at 
least 2 h or by autoclaving to eliminate ribonucleases. Total RNA 
was isolated from 12-15 days after flowering seeds as described 
previously (Krishnan et ai, 1986). Polyadenylated (poly(A + )) RNA 
was selected by oligo(dT) -cellulose chromatography as described by 
Maniatis et ai (1982). A cDNA library was prepared in X gtll and 
screened with either antiserum, prepared against spinach leaf holo- 
enzyme of ADP-glucose pyrophosphorylase (Krishnan et al. y 1986), 
or 32 P-labeled AGP-1 cDNA obtained by nick translation (Maniatis 
et ai, 1982). 

DNA Sequencing and Primer Extension — The cDNA clones were 
subcloned into M13-mpl8, and sequential deletions were prepared 
using exonuclease III as described by Henikoff (1984). Single- 
stranded DNAs from an ordered set of deletions were isolated and 
sequenced by the Sanger dideoxy chain termination method (Sanger 
et ai, 1977). Sequence analyses were performed with the UWGCC 
program (Devereux et ai, 1984). Direct RNA sequencing was carried 
out by reverse transcribing poly(A + ) RNA in the presence of a 16- 
mer synthetic oligonucleotide complimentary to the region 20 bases 
downstream from the 5 '-end of the RSc6 cDNA clone. The primer 
extension reaction was performed essentially as described by Inoue 
and Cech (1985). 

RESULTS AND DISCUSSION 

Cloning of ADP-glucose Pyrophosphorylase mRNA — In a 
previous study, a cDNA clone, AGP-1, was isolated by screen- 
ing a X gtll library with antibodies raised against the spinach 
leaf ADP-glucose pyrophosphorylase (Krishnan et ai, 1986). 
The structure of this cDNA clone was determined by DNA 
sequence analysis to be a cloning artifact. Although the 5'- 
end segment of the DNA insert contained an open reading 
frame (ORF) of 1035 bp for ADP-glucose pyrophosphorylase, 
it displayed an unusually long 3 '-segment of over 670 bp and 
lacked a poly(A) tail (Fig. 1). As an additional 500-600 nucle- 
otides would be required to complete the remaining coding 
sequence and poly(A) tract, the total length of this cDNA 
would greatly exceed the estimated transcript size of 1800 
nucleotides determined by Northern blot analysis (Krishnan 
et ai, 1986). The 3'-noncoding sequence of this clone was 
observed to be unreactive to rice seed poly(A + ) RNA or DNA 
indicating that it was a spurious contaminant isolated during 
the cloning procedure. To obtain an authentic recombinant 
clone, a second library was constructed in X gtll and screened 
by nucleic acid hybridization using the AGP-1 insert as a 
probe. Fourteen clones were isolated. One of these clones, 
pRSc6, contained a cDNA insert of 1650 nucleotides and was 
chosen for further study. 

Sequence and Amino Acid Analysis — The 1650-bp cDNA 
clone RSc6 was sequenced on both strands by dideoxyribo- 
nucleotide chain termination reactions using overlapping de- 
leted clones as templates (Fig. 1). An ORF of 483 residues 
was found which begins with an ATG at bp position 1 and 
ends at the termination codon TAA at bp position 1460 (Fig. 
2), This ORF of 483 amino acids predicts a protein with a 
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Fig. 1, Physical map of restriction enzyme sites for endo- 
sperm-specific ADP-glucose pyrophosphorylase cDNAs. The 

cDNAs were purified from the X gtll cloning vector and inserted into 
Bluescript plasmid DNA. The resulting plasmid DNA was digested 
with restriction enzymes that digested the cloning vector only once. 
AGP-1 contained DNA sequences that encoded a truncated subunit 
possessing 341 residues from the C terminus. The 3 '-end of AGP-1 
contained 28 nucleotides of the 3 '-untranslated region and an addi- 
tional 646 bp of non -pyrophosphorylase sequence inadvertently 
cloned during the construction of the cDNA library. A second cDNA 
clone, RSc6, was isolated from a second cDNA library using 32 P- 
labeled AGP-1 DNA as a probe. Open rectangles represent the coding 
sequences for ADP-glucose pyrophosphorylase. 

molecular weight of approximately 52,000 consistent with the 
size of the polypeptide of 56,000 estimated by sodium dodecyl 
sulfate-polyacrylamide gel electrophoresis of in intro-trans- 
lated product (Krishnan et a/., 1986). The termination codon 
is followed by a 3 '-untranslated region of 201 bp and a poly(A) 
tail. The consensus polyadenylation signal, AAUAAA, typical 
of most nonhistone eukaryotic mRNAs (Proudfoot and 
Brownlee, 1976), is present but is 117 bp upstream from the 
polyadenylation addition site. A variant form, AAUGAU, is 
observed within the usual 10-30 bp of the poly(A) site and 
may serve as the signal. 

As the cDNA clone began with an ATG, it was not clear 
whether these nucleotides represented the initiation codon or 
an internal methionine residue. Therefore, RNA sequencing 
studies were initiated. A synthetic 16-base oligonucleotide, 
complementary to nucleotides 20-35 of the messenger RNA 
transcript, was prepared, annealed to poly (A + ) -RNA, and 
subjected to chain termination reactions using reverse tran- 
scriptase. An additional 27 nucleotides at the 5 '-end were 
obtained by this method and revealed two nonsense codons. 
One of these termination codons lies 15 bases upstream of the 
putative ATG start and is in -frame with the coding sequence 
(Fig. 2). The presence of a nonsense codon upstream of the 
putative translational start indicates that the ORF present in 
pRSc6 is representative of the complete ADP-glucose pyro- 
phosphorylase coding sequence. The initiator codon is located 
in the sequence CAACCATCA which is consistent with the 
consensus translational start sites of both animal (Kozak, 
1984) and plant (Lutcke et aL, 1987) transcripts. 

ADP-glucose pyrophosphorylase is located exclusively in 
the chloroplasts of leaves (Okita et ai, 1979) and the amylo- 
plast in developing endosperm and potato tuber tissue. 2 As 
this enzyme is encoded by nuclear genes, it was expected that 
the protein would possess a leader sequence required for 
plastid localization. The N-terminal sequence of the mature 
subunit has not been elucidated, and, therefore, the exact 
processing site cannot be specifically assigned. However, 
alignment of the rice primary sequence to the N-terminal 
sequence of the purified spinach leaf 51,000 pyrophosphoryl- 
ase subunit obtained by Edman degradation (Morell et ai, 
1987b, 1988) indicates partial homology and suggests that the 
proteolytic cleavage site occurs between residues 28 and 29 

2 W. T. Kim, V. R. Francheschi, T. W. Okita, N. L. Robinson, M. 
Morell. and J. Preiss, manuscript in preparation. 
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IGEGCVIKNCKIHHSVVGLRSCIS 
GAAGGCGCAATAATAGAGGACTCATTACTCATGGGAGCTGACT 

egaiiedsllmgadyyeteadkkl; 

CTTGGTGAAAAAGGTGGCATTCCC^TTGGTATTGGGAAGAATTGCC^ 
LGEKGGIPIGIGKNCHIRRAI ID K ™ 

AATGCTCGTATTGGAGATAATGTGAAGA 
NARIGDNVKI INVDNVQEAARETD* 

GGATACTTCATCAAAAGTGGCATTGTTACCCT 
GYFIKSGIVTVIKDALLLAEQLYE 

GTAGCTGCGTMTATATGATGGGGCATCGGCGACGAGCACCAGGCGG 

V A A * 1^94 
TCAAGGCTCTTTATTCCCTTTTTTT w 7 

GCAACTGCTTTTCAGATGm 
Fig 2 DNA and deduced primary sequence of RSc6. The DNA sequence was obtained by analysis of 
recombinant clones containing overlapping segments of DNA as described in the text. Numbers on the right are 
in bp or amino acid residues relative to the translational start. The 5 ' -untranslated sequences were obtained by 
RNA sequencing of rice seed poly( A + ) RNA as described in the text. The translation initiation and polyadenylation 
signals are underlined. 
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Fig. 3. Alignment of the N- terminal regions of the encoded rice polypeptide to the mature spinach 
leaf M 51 000 pyrophosphorylase subunit determined by Edman degradation (Morell et a/., 1987a). 

Residue's conserved between the rice and spinach sequences are enclosed in boxes. Gaps were introduced to 
maximize the homology. The arrow designates the putative proteolytic cleavage site for the rice-encoded polypep- 
tide. The hydrophobic N terminus and charged region enriched in basic residues of the rice pyrophosphorylase 
leader sequence are overlined. 

(Fie 3) The putative leader sequence of ADP-glucose pyre- possessing physical properties similar to those observed in 
phosphorylase, however, shares very little sequence conser- other chloroplast transit peptides. The N terminus possesses 
vation particularly with the three blocks of homology sug- a 6-residue hydrophobic domain and a highly basic region 
gested by Karlin-Neumann and Tobin (1986) to be necessary near the C-terminal end of the leader peptide (Fig. 3) and 
for chloroplast transport and processing. The pyrophosphor- these features may serve as the signals necessary for plastid 
ylase leader sequence does contain short peptide domains targeting and processing of this plant protein. 
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Fig. 4. Conserved primary sequences displayed between the rice endosperm {R) and E. coli (E) 
ADP-glucose pyrophosphorylases. Residues conserved among these two proteins are enclosed in boxes. Gaps, 
indicated by dashes, were introduced to maintain homology. Numbers labeled on the right side of the figure indicate 
the number of residues from the translational start of either the rice or bacterial subunit. Lys 39 , Tyr 114 , Lys 195 , 
Lys 296 , and Gly 336 residues essential for allosteric and/or catalytic function of the bacterial enzyme are underlined. 
The conserved rice sequence at residues 462-472 homologous to the pyridoxylated peptide and which presumably 
lies in the allosteric region of the spinach leaf M t 51,000 subunit is overlined. 
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Fig. 5. Schematic representation of the primary structures 
of the E. coli and rice endosperm ADP-glucose pyrophosphor- 
ylases. The open boxes represent nonconserved regions while the 
striped boxes indicate peptide regions of significant similarity between 
the plant and bacterial subunits. The black box of the rice endosperm 
protein depicts the putative transit leader peptide. Residues essential 
for allosteric control or substrate binding of the bacterial enzyme and 
conserved in the plant enzyme are shown. Mutations in the bacterial 
gene at Lys 296 and Gly 336 , which are substituted by Glu and Asp, 
respectively, result in an altered ADP-glucose pyrophosphorylase 
with respect to allosteric properties (Lee et ai y 1987). Lys 469 of the 
rice subunit is the residue reactive to pyridoxal phosphate which 
mimics the allosteric effector, 3-PGA. 

The amino acid sequence derived from the rice cDNA clone 
sequence was compared with that obtained for the E. coli 
enzyme (Baecker et al. t 1983) (Fig. 4). This comparison 
showed there was an overall similarity of about 30% at the 
amino acid level between the plant and bacterial subunits, 
providing direct evidence for a common origin of these genes. 
Although significant divergence at the amino acid level is 
readily evident between these two enzymes, such changes 
occurred nonrandomly. When peptides delineated by residues 
48-227 and 297-394 are aligned the degree of similarity in- 



creased to 42%. Nonconserved regions of the plant and bac- 
terial subunits were evident at the N termini encompassing 
residues 29-47, a central domain between residues 228 and 
296, and at the C termini beginning with residue 395. The 
significant conservation displayed in the peptide domain 
spanning amino acids 48-227 is particularly noteworthy as 
specific residues have been identified that are essential for 
allosteric regulation and substrate binding of the bacterial 
enzyme (Parsons and Preiss, 1978). Lysine residues at posi- 
tions 39 and 195 of the E. coli primary sequence have been 
shown to be protected from phosphopyridoxylation by the 
activator, Fru-1,6-P 2 , and the substrate ADP-glucose 
(Baecker et ai t 1983; Parsons and Preiss, 1978), suggesting 
that these 2 residues lie at or near the allosteric and substrate 
binding sites. Likewise, the tyrosine residue at position 114 
in the E. coli enzyme when altered by site-directed mutagen- 
esis to phenylalanine causes lower affinity for its activator, 
fructose 1,6-P 2 , and substrates, ATP and ADP-glucose, and 
altered a lower apparent binding for its inhibitor, AMP (Ku- 
mar et aij 1988; Larsen et aL, 1986). The conservation of 
other amino acids in this region in the plant enzyme is 
strongly suggestive that these amino acids may also contribute 
a major role in maintenance of protein conformation and/or 
the regulatory and catalytic functions of the plant enzyme 
with 3-PGA as the major activator and P; the inhibitor. 

As the bacterial and plant enzymes catalyze the same 
reaction the structural differences exhibited by the plant and 
bacterial enzymes most likely reflect the nature of the effector 
molecules that mediates allosteric regulation of each enzyme. 
Other than the low homology exhibited between residues 228 
and 296, major structural changes are evident at the C termini 
of the primary sequences between the plant and bacterial 
enzymes (Figs. 4 and 5). Morell et aL (1987b, 1988) have 
shown that pyridoxal phosphate, which mimics the activator 
3-PGA, reacts upon reduction to a specific lysine residue of 
the spinach leaf 51,000 subunit. The reactive residue, however. 
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was not the conserved Lys 68 whose counterpart, Lys 39 , lies 
within the allosteric region of the E. coli subunit but within 
the sequence Ser-Gly-Ile-Val-Thr-Val-Ile-Lys-Asp-Ala-Leu. 
This peptide, which is observed in the rice endosperm subunit, 
is located at the nonconserved C terminus of the plant enzyme 
(Figs. 2 and 4) suggesting that at least a portion of the primary 
structure responsible for regulation of the plant enzyme lies 
on an extended peptide not evident in the bacterial subunit 
In addition the change in allosteric specificity of the plant 
enzyme is most likely due to amino acid replacements in other 
regions of the plant enzyme. Lee et ai (1987) and Kumar et 
al (1989) have shown that point mutations of the bacterial 
gene where Lys 296 and Gly 336 are substituted by glutamic and 
aspartic acid residues, respectively, result in the formation of 
a bacterial enzyme whose activity is less dependent on allo- 
steric activation by Fru-1,6-P 2 . Interestingly, residues of the 
plant subunit which directly align with Lys 296 and Gly 336 of 
the bacterial subunit are negatively charged amino acids (Figs. 
4 and 5). Point mutations to acidic residues at positions 329 
and 363 and addition of a C-terminal peptide to the plant 
enzyme may have simultaneously resulted in the loss of allo- 
steric specificity to Fru-1,6-P2 but a gain in sensitivity to 3- 
PGA. Whether these point mutations and structural differ- 
ences are a manifestation of the nature of the effector mole- 
cules remains to be explored. The allosteric and catalytic 
behavior of a recombinant enzyme constructed by the trans- 
location of the plant C-terminal peptide to the bacterial 
allosteric mutant enzyme may provide such information. 
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VIII. Argument 

A. REJECTION OF CLAIMS OVER THE GOLDMAN ET AL. '073 PATENT 
SUMMARY OF THE REJECTION 

Each of the pending claims has been finally rejected on the 
basis of both obviousness and anticipation over the Goldman et al . 
'073 patent. The position taken in the final Action is that (1) 
Goldman is at least enabling for what it claims (a process for 
introducing DNA into corn, without any representation that the 
process enables the preparation of fertile corn); and (2) that 
Goldman positively shows that "hygromycin resistance" and "EPSP 
synthase" activity had been achieved using AgroJbacteriujn-mediated 
transformation . 

In a rejection being considered together with the foregoing, 
the Action posits that Appellants' specification admits that the 
particular genes of the pending claims are known in the prior art. 
The Action reasons that at least certain of these references 
disclose the introduction of the gene(s) into other plant species, 
and this provides the requisite motivation for introduction of 
these genes into corn. 

APPELLANTS' LEGAL BASIS AND SUMMARY OF ARGUMENT 
With respect to the anticipation aspect of the rejection, 
Appellants note that none of the presently pending claims recites 
subject matter that is in any way taught or disclosed in Goldman. 
The independent claims currently on appeal are directed to fertile, 
transgenic maize plants that have introduced one of more transgenes 
selected from a Jbar gene, a nitrilase gene, a j3-lactamase gene, an 



xylE gene, a galactosidase gene, a tyrosinase gene, or a wheat germ 
agglutinin gene. Appellants have been unable to identify any 
disclosure in Goldman related to any of these particular individual 
genes. Thus, it is respectfully submitted that any concern 
regarding anticipation is misplaced. 

With respect to the obviousness concerns, in order to maintain 
the rejection, the Examiner must demonstrate that a case of 
obviousness of the appealed claims is established by the teachings 
of the '073 patent specification considered in combination with the 
secondary references. In re Deuel, 34 U.S.P.Q.2d 1210, 1214 (Fed. 
Cir. 1995) . The claims of the present appeal require fertile, 
transgenic corn plants bearing one or more of the specified genes. 
To be relevant to patentability, the '073 patent, alone or in 
combination with secondary teachings, must provide: 

(1) detailed enabling methodology for reproducibly preparing 
fertile, transgenic corn bearing the individual genes 
specified in the claims; 

(2) a suggestion to introduce the particular genes into corn; 
and 

(3) evidence suggesting that the particular genes would 
function appropriately upon successful introduction into 
corn, and that functional genes would be transmitted 
progeny plants. 

O'Farrell , 7 U.S.P.Q.2d at 1680. Goldman, alone or in combination 
with the secondary references, fails to satisfy the foregoing 
requisites of O'Farrell . 



With respect to enablement,- the art must do more than simply 

teach a means of introducing DNA into corn, it must teach a 

reproducible and reliable method of obtaining fertile, transgenic 

corn bearing the particular genes specified in the claims. As 

noted early on by the Supreme Court in the case of Seymour v. 

Osborne, 78 U.S. 516, 20 L. Ed. 33 (1871): 

Mere vague and general representations will not support [the 
invalidity of a later invention] , as the publication must be 
sufficient to enable those skilled in the art or science to 
understand the nature and operation of the invention, and to 
carry it into practical use. 

78 U.S. at 555. 

The evidence of record demonstrates that the method for 

introducing DNA into corn as set forth in the '073 patent was 

incapable of enabling the production of fertile, transgenic corn at 

the time of its priority date, 1987 or 1986. The ' 073 

specification was found to be unreliable for this purpose by both 

the PTO and an inventor of the '073 patent. As noted by Judge 

Learned Hand, no doctrine is better established in patent law than 

that in order for a prior patent to invalidate a later claim, the 

prior patent must bear adequate directions within its four corners, 

its results must be reproducible, and it must be "more than a 

starting point for further experiments": 

No doctrine of the patent law is better established than that 
a prior patent or other publication to be an anticipation must 
bear within its four corners adequate directions for the 
practice of the patent invalidated. If the earlier disclosure 
offers no more than a starting point for further experiments, 
if its teaching will sometimes succeed and sometimes fail if 
it does not inform the art without more how to practice the 
new invention, it has not correspondingly enriched the store 
of common knowledge and it is not an anticipation. 



Dewey & Almy Chem. Co. v. Mimex Co., 124 F.2d 986, 980, 52 U.S.P.Q. 
138 (2d Cir. 1942). Thus, to be considered for the purposes of 
obviousness, a reference must either be fully enabling in and of 
itself, or be combined with a secondary reference whose teachings 
will remedy the inadequacies of the first reference. Symbol 
Technologies, Inc. v. Opticon, Inc., 19 U.S.P.Q.2d 1241 (Fed.* Cir. 
1991) . 

In order to demonstrate that a reference in non-enabling, 
Appellants are required to do so by only a preponderance of the 
evidence. In re Sasse, 207 U.S.P.Q. 107, 111 (CCPA 1980). Once an 
appellant demonstrates non-enablement by a preponderance of the 
evidence, the obviousness rejection cannot be maintained unless the 
Examiner comes forward with evidence negating the appellant's basis 
of the Goldman patent's non-enablement. 

In the present case, Appellants will demonstrate that a 
preponderance of the evidence compels the conclusion that the '073 
patent does not reproducibly and reliably enable the preparation of 
fertile, transgenic corn, for the reasons that: 

(1) the only evidence of gene transmission to a second 
generation presented by Goldman employed an improper 
assay that was not accepted by the PTO in the Goldman 
prosecution; 

(2) Goldman acquiesced in the earlier finding by the PTO that 
the '073 specification does not enable fertile, 
transgenic corn; 

(3) the PTO itself has argued successfully that Agrobacterium 



was not enabled for the preparation of transgenic 
monocots at a time well after the Goldman et al. priority 
date(s) in a published Federal Circuit opinion; 

(4) the particular Agrobacterium strains taught by Goldman 
are now known to be ineffective in the preparation of 
fertile, transgenic corn; and 

(5) one of the Goldman inventors, Ann Graves, confirms that 
the technique disclosed in the '073 specification is 
unpredictable and unreliable. 

The foregoing evidence provides a strong indication of 
inoperability . In response to these indications, the Examiner 
fails to counter with a scientifically reasonable explanation of 
why the art would expect the Goldman method to yield fertile, 
transgenic corn, much less other evidence of operability of 
Goldman's method. 

The second requirement of O'Farrell is that the Examiner must 
demonstrate a suggestion in the art to modify the principal 
reference to arrive at the subject matter of the claims. Here, the 
claims relate to fertile, transgenic corn plants that have been 
genetically engineered to introduce particular gene species. The 
Examiner must thus show the secondary references provide a 
suggestion to modify the Goldman teachings with respect to the 
individual specified genes. 

The only secondary references even generally identified by the 
Examiner are those articles cited in Appellants' specification, 
that merely detail the isolation and/or in vitro manipulation of 



the various genes that Appellants have introduced into transgenic 
corn. However, none of these articles in any way teaches or 
suggests introducing any of the recited genes into the corn genome. 
It is indeed Appellant's specification that teaches the 
introduction of the recited genes into corn and it is improper to 
use the Appellant's own teachings to support an obviousness 
rejection. Furthermore, the secondary art fails to provide any 
discussion relevant to how one would proceed to introduce the genes 
into corn, fails to provide any evidence that such an introduction 
would be successful if attempted, and fails to provide any evidence 
that the genes would function appropriately if introduced into 
corn. 

Lastly, O'Farrell requires that there be some evidence that 
the prior art technique would be successful in the preparation of 
transgenic corn bearing the particular cited genes, and that the 
genes would be expected to function appropriately in progeny. If 
the evidence is insufficient to establish that Goldman had actually 
achieved fertile, transgenic corn, there is certainly no evidence 
that the Goldman technique would be successful in the case of the 
particular genes of the present invention that are not even 
mentioned in Goldman. Furthermore, because of the unreliability of 
the opine assays used by Goldman to purportedly demonstrate 
successful introduction of a gene into corn, there is no evidence 
that the method disclosed in the Goldman specification would be 
successful in the case of any gene. 



THE GOLDMAN TECHNIQUE DOES NOT ENABLE THE 
PREPARATION OF TRANSGENIC MAIZE OR PROGENY 

Agrobacterium Mediated DMA Transfer in Dicotyledonous Plants 

Agrobacteri urn tumefaciens is a bacterium that is used 
extensively for the introduction of recombinant DNA into 
dicotyledonous plants (dicots) . Wounding of a dicot plant cell 
results in induction of a series of DNA procesing and transfer 
events in Agrobacterium that culminate in the introduction of DNA 
into the cell by means of the T-DNA (Transferred DNA) portion of 
the Ti plasmid. 1 The T-DNA integrates into the nuclear genome of 
the infected plant cell, and the plant cell in turn seeks to 
express the genes carried on the T-DNA, and indeed manufactures 
certain enzymes encoded by the T-DNA segment that elicits the 
synthesis of tumor-specific compounds called opines. Thus, 
introduction and expression of foreign genes into dicots is 
achieved through Agrobacterium infection by inserting the desired 
gene into the T-DNA region of the Ti plasmid, followed by 
appropriate wounding and infection of the dicot. Numerous factors 
affect the ability of any particular plant species to be 
susceptible to AgrroJbacteriujn-mediated gene transfer, particularly 
whether the species has an appropriate wound response mechanism, 
and how the particular strain of Agrobacterium responds to the 
wounded cell. 

1 A Ti plasmid is an extrachromosomal closed circular DNA that carries genes that will 
induce crown gall disease in plants, and is used as the DNA carrying vector in Agrobacterium- 
mediated plant transformation. 



The Agrobacterium-based Technique Disclosed by Goldman 

The Agrobacterium technique disclosed by Goldman is no 
different from the technique as it was traditionally practiced in 
dicots. Goldman suggests making a wound in a Gramineae seedling in 
an area containing rapidly dividing cells, and inoculating the 
wound with vir+ Agrobacterium tumefaciens containing a genetically 
engineered T-DNA (Exhibit B; col. 6, 1. 45-69). After the 
seedlings are inoculated with the vir+ A. tumefaciens , they are 
incubated until transformation takes place, at which time the 
seedlings are planted and allowed to grow until they produce 
pollen. (col. 7, 1. 55-60). 

Why Maize is Not An Appropriate Host for Agrobacterium 
Maize is not a natural host of Agrobacterium, because it lacks 
at least one key feature required for successful transformation by 
Agrobacterium. Cereals in general lack the wound response that is 
necessary for successful integrative Agrobacterium transformation. 
Instead, as explained by Potrykus (Exhibit D) , cereal cells die 
when wounded, whereas most dicots and some monocots respond to 
wounding by production of dedif f erentiation and cell proliferation 
in adjacent cells. Furthermore, most dicot cells are known to 
produce phenolic compounds following wounding, whereas these 
compounds are generally not produced by wounded cereal cells. 
These two features of cereal wound responses, i.e., lack of cell 
proliferation at the wound site and lack of production of phenolic 
compounds, are impediments to successful Agrobacterium 
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transformation. Binns (Exhibit E) explained that successful 

Agrobacterium transformation requires that the cells into which 

Agrobacterium T-DNA is transferred must be capable of cell division 

and monocots typically do not exhibit this response: 

Thus, the ability of wounded plant cells to enter and 
carry out one or more cell cycles may be absolutely 
required for successful [Agrobacterium] transformation . 
It is interesting to note that the wound response of most 
monocotyledons is quite different from that of 
dicotyledon: cells around the wound site lignify or 
sclerify in the absence of apparent cell division. 

(Exhibit E, p. 137, right column). Therefore, a wounded monocot 

cell is not competent for Agrobacterium transformation and the 

Agrobacterium infected cell is fated to die. 

Phenolic compounds produced by wounded plant cells are 

detected by Agrobacterium virulence (vir) genes, ultimately 

resulting in the induction of T-DNA processing and DNA delivery to 

the host cell. Usami et al. (Exhibit F) explained that whereas 

dicot plants produce diffusible compounds, e.g., phenolic 

compounds, that are capable of inducing vir gene expression, 

monocot plants, including corn, do not produce these compounds and 

therefore the process of T-DNA transfer to the host cell is never 

initiated following Agrobacterium infection of a wounded monocot 

cell. Usami et al. (Exhibit F) further explains that the process 

of T-DNA transfer can be initiated in a monocot following external 

application of a known vir gene inducing compound (e.g. 

acetyosyringone) . The '073 specification does not teach a method 

of inducing vir gene expression in Agrobacterium and, therefore, it 

is highly unlikely that T-DNA processing and delivery to the host 



cell would occur following the methods disclosed in the '073 
specif ication . 

Agroinf ection of Maize via Agrobacterium Does Not Lead to 
Stable Transformation and Integration 

While it is true that viral DNA, e.g., Maize Streak Virus 
(MSV) , can be delivered to maize via Agrobacterium tumefaciens in 
a process termed "agroinf ection" (Boulton et al . , Exhibit G) , 
successful virus infection is based upon the ability of the virus 
to move from cell to cell and spread to sites away from the wound 
inoculation site, i.e., away from those cells fated to die in 
response to wounding. If the DNA were integrated into the host 
genome of the wounded recipient cell as purported by Goldman, a 
successful viral infection would not occur because the virus would 
be unable to move to the healthy proliferating cells away from the 
wound inoculation site. 

Goldman Teaches the Wrong Strain of Agrobacterium 
The inoperability of the Goldman disclosure in the preparation 
of fertile, transgenic corn is further evidenced by the fact that 
Goldman discloses inoculation of maize seedlings with both octopine 
and nopaline synthezising strains of Agrobacterium and purport to 
achieve equal ef f iciecies of DNA delivery with both types of 
strains. However, even agroinf ection of maize with MSV using 
Agrobacterium as a vector is highly dependent upon the strain of 
bacteria used. In particular, Boulton et al . demonstrated that 
whereas it was possible to achieve agroinf ection of maize plants 
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with MSV using nopaline producing strains of Agrobacterium such as 
C58, it was not possible to do so using octopine producing strains 
such as Ach5 and others (see Table 1, Boulton et al . , Exhibit G) . 
Goldman, however, purports to produce fertile, transformed plants 
in Example XI using Agrobacterium strain C19, a derivative of 
strain Ach5! Thus, it is highly improbable that Goldman could have 
achieved even non- integrative -agroinf ection with Ach5-derived 
strains. 

The data in column 16 of the Goldman patent was also produced 
using an octopine producing strain of Agrobacterium which based on 
Agroinf ection data (Boulton et al . , Exhibit G) is not capable of 
delivering DNA to maize cells. The strain of Agrojbacteriujn which 
is capable of delivering DNA to the maize cell is not related to 
the type of DNA that is being delivered (e.g., viral or non-viral) , 
but is related to the process that Agrobacterium uses for 
delivering any DNA sequence to a plant cell. Therefore, a strain 
of Agrobacterium that does not deliver DNA to maize cells in the 
process of Agroinf ection, can hardly be expected to deliver DNA to 
maize cells using the process described by Goldman. 

The Assays Employed by Goldman Are Notoriously Unreliable and 
the Controls Employed Were Inappropriate 

Of particular importance is the fact that Goldman relies only 
upon the detection of octopine to verify that transformation has 
occurred. The assays employed by Goldman detect the conversion of 
arginine to an opine compound, and are notoriously unreliable as 
proof of the presence of T-DNA. Christou et al. (Exhibit H) 



demonstrated that normal callus and plant tissue of several species 
of plants, including maize (page 220) are capable of converting 
arginine to nopaline in the absence of T-DNA l Thus, the presence 
of enzyme activities that will convert arginine to opine compounds 
is to be expected in maize cells that do not contain T-DNA and 
hence there is no definitive evidence that the plants disclosed by 
Goldman are transformed at all. This was the basis of the PTO's § 
112(1) rejection in the Goldman prosecution. 

The Example in column 16 of the Goldman specification does not 
purport to disclose fertile transgenic maize plants and progeny, 
but at best purports to disclose transient expression of introduced 
DNA, i.e., lysopine dehydrogenase activity was assayed in 
inoculated seedlings only at 7-14 days after inoculation. 
Appellants submit that the proper controls were not done in these 
experiments. Goldman disclosed assays of seedlings that have been 
infected with strains of Agrrojbacteriujn that contain functional or 
non-functional vir genes. Both the vir+ and the vir- strains of 
Agrobacterium used by Goldman contain a functional lysopine 
dehydrogenase gene. The proper control for this experiment should 
have been a strain of Agrrojbacteriu/n completely lacking the Ti 
plasmid, including the lysopine dehydrogenase gene, or uninfected 
maize seedlings. 

Furthermore, the number of samples assayed in the experiment 
disclosed in column 16 of the Goldman patent is too small to 
generate data from which conclusions may be drawn. In Figure 3B of 
the Goldman patent (described in column 16) , five maize seedlings 

12 



inoculated with a vir+ strain of Agrobacterium were assayed and one 

seedling was observed to produce a compound that migrated similar 

to octopine. However, only four vir- inoculated seedings were 

assayed for lysopine dehydrogenase activity. This is simply not 

enough seedling assays to conclude that lysopine dehydrogenase 

activity is not present in vir- Agrobacterium infected maize 

seedlings. Goldman et al . did not even assay as many vir- 

AgrroJbacteriuin inoculated seedlings as they did vir+ Agrobacterium 

inoculated seedings! If only four vir+ AgrrojbacteriuTn inoculated 

maize seedlings had been assayed, it is possible that Goldman et 

al. would have concluded that vir+ Agrobacterium strains are not 

capable of transforming maize! 

The importance of sufficient numbers of proper negative 

controls is underscored by Christou et al . (Exhibit H) , who 

specifically addressed the unreliability of opine assays. Christou 

et al . concluded that, 

"Our results indicate that the presence of opine synthase 
activity in callus or plant tissue should only be 
considered a preliminary indication of transformation by 
Agrobacterium . Further biochemical analyses need to be 
carried out in order to establish whether transformation 
has taken place. It .is not clear if this endogenous 
synthase activity is physiologically important to the 
plant or if it is a laboratory artifact of some unknown 
enzyme that has another catalytic role. " (emphasis added, 
page 221, left column) . 

In his discussion Christou refers to all opine synthesizing 

enzymes including lysopine dehydrogenase and nopaline dehydrogenase 

as opine synthase activities, and states that these activities are 

not conclusive evidence of transformation. Goldman's only evidence 

for transformation of maize is the presence of opine synthesizing 
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activities, Goldman has no evidence beyond purported evidence of 
opine synthesizing activity in Agrobacterium infected maize tissues 
and certainly does not demonstrate the presence of introduced 
foreign DNA in maize or even the stability of expression of the 
purported opine synthesizing activities. 

There is No Teaching in Goldman that "Hygromycin Resistance 11 
or "EPSP Synthase" Expression Had Actually Been Attempted or 
Accomplished, and No Evidence of any Controls 

Nowhere does Goldman indicate that "hygromycin resistance" and 
"EPSP synthase" activity had been achieved. On the contrary, the 
excerpts from Goldman relied upon by the Examiner are clearly 
prophetic, and give no indication that any assays were ever 
actually carried out. 

The final Action states that Goldman teaches that hygromycin 
resistance "was obtained" and that proper controls were given, 
referring to column 21-22. This is submitted to be incorrect: The 
passage beginning at column 22, line 17, simply describes a 
hygromycin assay per se, written in the present tense rather than 
the past tense of the previous paragraph. There is no indication 
in this passage that negative or positive controls were or even 
should be carried out. 

The same can be said for Goldman's purported teaching of 
transgenic corn expressing "EPSP synthase" activity. The entire 
"Example X" of Goldman, beginning at col. 22, line 35, is written 
in the present tense, with no actual studies or results of studies 
being reported. This is true for the so-called "Assays" of same 
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reported beginning at col. 23, line 44 — As with the hygromycin 
"tests", no tests, no test results! 

DR. ANN GRAVES, ONE OF THE INVENTORS OF THE '073 PATENT, 
AGREES THAT THE TECHNIQUE IT DISCLOSES IS NOT RELIABLE OR 
PREDICTABLE 

Dr. Ann Graves, a named inventor of the '073 patent, has 
confirmed to the present Appellants that the technique disclosed in 
the '073 specification is unreliable and generally unpredictable. 
Dr. Christopher Flick, an employee of the instant assignee, 
contacted Dr. Graves to obtain her opinion about the invention 
described in the '073 specification. A declaration of Dr. Flick's 
setting forth the substance of his various conversations with Dr. 
Graves is attached hereto as Exhibit I, and further includes a 
detailing of Dr. Flick's discussions with a notable expert in the 
field of AgrroJbacteriujn-mediated plant transformation, Dr. Eugene 
Nester of the University of Washington. 

As detailed in paragraphs 5-10 of the Flick declaration, Dr. 
Graves has indicated that the Agrobacterium mediated transformation 
methods set forth in the '073 patent were unpredictable and 
unreliable in their results, and that the techniques were only 
successful on occasion. Dr. Graves suggested that this was because 
there was a narrow "window" of susceptibility of Graminae seedlings 
to Agrobacterium, and that there was no means known to her to 
pinpoint this "window" . She further indicated that without 
additional knowledge, Agrobacterium mediated transformation of 
Graminae, including maize, cannot be reliably carried out. 
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From the foregoing it can only be concluded that the '073 
patent is not enabling for the preparation of fertile, transgenic 
corn- Dr. Flick indicates that he has reviewed the '073 
specification and has confirmed that it fails to teach or suggest 
the problem pinpointed by Dr. Graves that must be overcome in order 
to provide an enabling method — that of identifying the requisite 
"window" for maize transformation with Agrobacterium. 

Dr. Flick's declaration is further strengthened by reference 
to statements made by Dr. Eugene Nester, a recognized expert in the 
transfer of DNA to plants by means of Agrobacterium . Dr. Nester 
indicated that transfer of DNA to monocots by Agrobacterium is 
highly dependent upon the correct selection of the Agrobacterium 
strain, and that octopine producing strains such as was taught by 
Goldman are incapable of delivering DNA to monocots such as corn. 
Furthermore, Dr. Nester indicates that at the time of Goldman's 
filing, there was no knowledge in existence that would have 
indicated to those of skill in the art which strains could be 
employed and under what specified conditions. Therefore, the 
Goldman specification is not only misleading, there were no 
teachings at that time to correct the deficiency. Lastly, Dr. 
Nester confirmed that opine assays such as those relied upon by 
Goldman are notoriously inaccurate in plant tissues, and concludes 
that the '073 specification discloses no credible evidence for 
transformation of maize or any other Graminae species. 
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THE "LAW OF THE CASE" MOST BE RESPECTED: THE PATENT OFFICE 
HAS ALREADY DETERMINED THAT GOLDMAN IS NOT ENABLING FOR THE 
PREPARATION OF FERTILE TRANSGENIC CORN — AND GOLDMAN 
ACQUIESCED IN THAT DETERMINATION 

Any allegation that Goldman is enabling for the preparation of 
fertile, transgenic corn is not supported by the record and is in 
direct contradiction to the PTO's position in the Goldman 
prosecution. The PTO consistently took the position on the Goldman 
record, that the Goldman specification is NOT enabling for the 
preparation of transgenic corn per se, and, at best, merely 
describes one technique for transferring T-DNA into corn. In this 
regard, Appellants direct the Board's attention to the file history 
of USSN 06/880,271, a now-abandoned parent of Goldman et al . , in 
which the Goldman applicants sought allowance of claims directed to 
transformed corn plants per se. 2 

In an Office Action dated 5/30/89 (see Exhibit J) , the Goldman 

examiner rejected a claim directed generally to transgenic corn 

plants under 35 U.S.C. § 112, first paragraph, stating that: 

The specification only demonstrates the expression of 
heterologous genes, namely opine synthases, in seedling tissue 
or plant parts directly derived from growing the transformed 
seedling. No demonstration of stable gene integration or 
sexual transmission of the exogenous gene, other than 
prophetic examples, is shown. Given the recalcitrance of 
monocots to Aqrobacterium transformation, as discussed by 
Applicants., and the possibility of transient opine synthase 
expression by non-integrative genes, as discussed by 
Hernalsteens et al . (see, e.g., page 3040, column 2, second 
paragraph) , undue experimentation would be required by one of 
ordinary skill in the art to obtain stable gene integration or 
sexual transmission of the exogenous gene as claimed. 



2 Appellants are not aware of any further pending applications in the Goldman series of 
applications. If the Examiner is aware of any such applications or patents, the Examiner is 
requested to make them of record. 
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In response, Goldman et al . failed to present any additional 
data per se on the record in support of a conclusion that stable 
integration had been achieved, merely pointing to new matter added 
in what is now Example XI of the '073 patent. However, this 
example merely relates to the purported demonstration of opine 
synthesis, already found by the PTO Examiner to be an insufficient 
test for stable gene integration. 

In the next Office Action (OA of 3/8/90; Exhibit K) , the 

rejection was maintained, with the examiner finding that stable 

gene integration had not been demonstrated: 

Applicants urge that rejection of the claims as being non- 
enabled for transformed plants obtained by sexual reproduction 
is improper, given the demonstration of opine synthase 
expression in cells directly derived from transformed 
meristematic cells . The Examiner maintains that opine 
synthase is an insufficient test for stable gene integration 
(see, e.g., Christou et al . which discloses opine production 
by untransformed cells) . (emphasis supplied) 

The application became abandoned as a result of Goldman's 
failure to respond to the 3/8/90 Office Action. This abandonment 
by Goldman of claims to transgenic plants is submitted to be an 
acquiescence, or at least a rebuttable presumption, that the 
specification is enabling only for the narrow process set forth in 
the '073 claims, and not enabling for the production of fertile, 
transgenic corn plants in general. 

In the case of Litton Systems Inc. v. Whirlpool Corporation, 
221 U.S.P.Q. 97 (Fed. Cir. 1984), the Federal Circuit held that 
failure to argue and appeal a final rejection on the basis of 
enablement under § 112, operated as an estoppel against the 
applicant to later assert that the original specification was 
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enabling. This principle has been adhered to in the more recent 
case of Waldemar Ltd., GmbH & Co. v. Osteonics Corp., 31 U.S.P.Q.2d 
1855 (Fed. Cir. 1994). That is precisely the situation here: The 
Goldman applicants attempted to claim transgenic plants, were 
finally rejected on the basis that there was insufficient evidence 
that transgenic plants had actually been obtained, and failed to 
appeal or continue to prosecute claims directed to transgenic 
plants. It is submitted that by analogy to the Whirlpool doctrine, 
it must be concluded that Goldman acquiesced in the holding of non- 
enablement, and it is inappropriate to now contravene the previous 
holding of the PTO and independently "resurrect" the subject matter 
of Goldman's acquiescence. 

No additional evidence in support of a conclusion that the PTO 
was wrong in its initial determination has been presented; there is 
no reasoned scientific explanation of record why we should now 
question the PTO's determination in the very issue at hand — 
whether Goldman is enabling for the preparation of transgenic maize 
plants. Absent additional evidence, under the "law of the case" 
doctrine, the previous finding of the PTO in the Goldman 
prosecution itself should be followed. This is not unlike the 
situation in Ex parte Edwards, 231 U.S.P.Q, 981 (PTO Bd. Pat. App. 
Interf. 1986), where the Board observed that it was bound by an 
earlier determination that the patent in question was entitled to 
a particular filing date. To contravene the "law of the case" 
doctrine, it is incumbent upon the Examiner to provide a cogent 
explanation as to why the earlier determination by the PTO in the 
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Goldman prosecution was incorrect and should not be followed. 

The Examiner's response to the foregoing position is that 
"each case is decided on its own merits." While Appellants agree 
with this proposition in general, it is unclear how this in any way 
supports a conclusion that Goldman is enabling for fertile, 
transgenic corn. Since the merits of Goldman is precisely what is 
at issue here, the fact that each case is decided on its own merits 
supports Appellants' position. The current "of record" PTO 
position is that Goldman is non-enabling for fertile, transgenic 
corn. This holding is thus effectively res judicata and binding 
absent additional evidence. 

THE POSITION THAT AGROBACTERHJM— MEDIATED TRANSFORMATION IS 
INEFFECTIVE IN CORN IS SUPPORTED BY THE CASELAW 

Appellants' position that Goldman is not enabling for the 
preparation of fertile, transgenic corn is supported by the recent 
decision by the Federal Circuit in In re Goodman, 29 U.S. P. Q. 2d 
2010 (Fed. Cir. 1993) . The Goodman case involved the issue of 
whether Agrojbacteriujn-mediated transformation is effective to 
transform monocots. The application in question in Goodman was 
filed at approximately the same time as that of Goldman (1985 v. 
1986) . The Federal Circuit, in confirming the decision of the 
Board of Appeals, held that Agrobacterium mediated transformation 
was ineffective to transform monocots such as corn, relying on 
numerous scientific sources for its determination. 

One of the principal pieces of art relied upon by the Federal 
Circuit in its determination was an article authored by Goodman in 



1987 — two years after the filing date of the Goodman application 

and one year after Goldman's filing date — which underscored the 

fact that even at that late date there had been no successes with 

monocot transformation using Agrobacterium. The Federal Circuit 

also relied upon a review authored by Dr. Ingo Potrykus, in which 

he stated that "it has been widely considered that monocotyledonous 

plants, including the commercially important crop plants of the 

Gramineae family, are insensitive to [Ti plasmid transformation] 

and thus are not candidates for use of this gene transfer system." 

29 U.S.P.Q. at 2014. 

The Federal Circuit summed up its finding that at as late as 

1987, Agrrojbacteriujn-mediated monocot transformation was 

"encumbered" with "great uncertainties," stating: 

Thus, even the references cited by Goodman to show enablement 
support the Board's position that great uncertainties 
encumbered Agrojbacteriura-mediated transformation in monocot 
plants at the time of filing. Goodman's 1987 article shows 
that the claimed invention did not overcome those 
uncertainties . . . Thus, on Goodman's 1985 filing date, the 
record shows no reliable gene transformation method for use 
with monocot plants. Each of the methods for monocot plants 
was fraught with unpredictability. The teachings in the 
specification do not cure this unpredictability. The record 
shows that practicing a gene transformation method for all 
monocot plants, if possible at all in 1985, would have 
required extensive experimentation that would preclude 
patentability. 

29 U.S.P.Q. 2014-2015. 

GOLDMAN MOST BE HELD TO THE SAME STANDARD OF PROOF OF 
ENABLEMENT AS HAVE THE CLAIMS OF THE PRESENT CASE 

The claims of the present case have been subject to an 

exceedingly stringent standard of proof of enablement, with the 
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Examiner continually taking the position that only those transgenic 
plants bearing transgenes actually shown to function through actual 
experimental results are enabled. In the present prosecution, only 
following the demonstration of the successful introduction and 
expression of many, many different transgenes in corn has the 
Examiner agreed that Appellants' gene species have been enabled. 

Appellants do not agree with the Examiner's stringent approach 
in this regard, and are of the opinion that once a reasonable 
number of species have been demonstrated, that Appellants should be 
entitled to a generic claim. Nevertheless, as noted above, prior 
art is subject to the same enablement standard as is an applicant's 
specification. See, Dewey & Almy Chem. Co. v. Mimex Co. and Seymour 
v. Osborne, supra. It is therefore submitted that in the context 
of the present prosecution, and the present prosecution only, the 
Examiner must apply the same rigorous standard to Goldman as has 
been applied against Appellants. The only "transgene" expression 
purportedly demonstrated by Goldman was lysopine dehydrogenase. 

THE CASELAW SUPPORTS A CONCLUSION THAT THERE IS NO MOTIVATION 
TO INTRODUCE THE PARTICULAR GENES SPECIFIED IN THE APPEALED 
CLAIMS INTO MAIZE 

In order to find the appealed claims obvious, the Examiner 
must demonstrate that the prior art provides a reproducible method 
of preparing transgenic maize, that the individual genes specified 
in the claims are enabled, and that there is sufficient motivation 
for one of skill in the art to introduce the specified genes. It 
is Appellants' position that even if the Goldman technique is found 



to be broadly enabling for the preparation of transgenic offspring 
corn, it nevertheless fails to obviate the particular genes 
specified in the claims absent a motivation to introduce the 
particular gene into corn, and evidence that these genes would 
function appropriately upon introduction into corn. In re Vaeck, 
20 U.S.P.Q. 1438 (Fed. Cir. 1991). 

The claimed combination of elements in each of the claims — 
which specify particular foreign genes introduced into the corn 
genome — is a novel combination. As such, it is incumbent upon 
the Examiner to identify individual teaching (s) that would suggest 
the desirability of making each -of the claimed combinations. As 
recently pointed out by the Federal Circuit, in assessing the 
patentability of a novel combination of otherwise old elements, 
" [t]he critical inquiry is whether there is something in the prior 
art as a whole to suggest the desirability, and thus the 
obviousness, of making the combination." In re Newell, 13 
U.S.P.Q. 2d 1248, 1250 (Fed. Cir. 1989). 

The requirement that examiner's identify such a motivation is 
a longstanding patent law doctrine, even in biological cases. In 
the well-known case of In re Bergel and Stock, the CCPA stated it 
thusly: 

The mere fact that it is- possible to find two isolated 
disclosures that might be combined in such a way to produce a 
new compound does hot necessarily render such production 
obvious unless the art also contains something to suggest the 
desirability of the proposed combination. 

130 U.S.P.Q. 206 (CCPA 1961). 
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Motivation to combine elements can not be inferred from prior 

art that enables one of the elements of the combination. This is 

the clear meaning of the relevant case law, such as In re Gordon, 

wherein the Federal Circuit stated: 

The mere fact that the prior art could be so modified would 
not have made the modification obvious unless the prior art 
suggested the desirability of the modification. 

221 U.S.P.Q. 1125 (Fed. Cir. 1984). 

This doctrine has been routinely embraced by the PTO Board of 
Appeals. For example, in Ex parte Kranz , the Board made it clear 
that examiner's must identify either an explicit motivation, or a 
"compelling motivation based upon sound scientific principles." 19 
U.S.P.Q. 2d 1216, 1218 (BPAI 1990). 

Here, the Examiner has merely taken the position that since 
the genes are known, and some have been placed into a particular 
plant other than maize, that the introduction of the gene into 
maize in somehow inherently motivated. It is indeed Appellants' 
specification that provides for the first time the motivation for 
introducing the recited genes into maize — and for this reason it 
is improper hindsight for the Examiner to rely upon Appellants' own 
specification to provide the necessary motivation. In re Deminski , 
230 U.S.P.Q. 313 (Fed. Cir. 1986) This is certainly not the 
"compelling motivation" based upon "sound scientific principles" 
referred to by the Kranz Board, and certainly an insufficient basis 
for making out a prima facie case of obviousness. 

Appellants will now turn to a consideration of each of the 
individual species of corn/transgene combinations set forth in the 



claims pending in the appeal. Appellants have attempted to collect 
those references referred to in the subject specification and 
apparently relied upon by the Examiner in completing his basis of 
rejection. Appellants will address each combination individually, 
and requests that patentability be considered separately. 

(1) fertile transgenic maize bearing a Jbar gene (claims 129, 
135-137) 

The gene encoding phosphinothricin acetyl transferase (the Jbar 
gene, e.g., Streptomyces hygroscopicus or S. viridochromogenes) 
inactivates the active ingredient of the herbicide bialophos, 
phosphinothricin. As reported in Appellants' specification at page 
14, lines 17-35, the Jbar gene has been successfully cloned and 
introduced into other plant species. At page 14, line 35, 
Appellants refer to the two articles of De Block et al . (see 
Exhibits L and M) as exemplary of articles demonstrating the 
successful introduction of a Jbar gene into plant species other than 
maize. In particular, the De Block et al . article disclose the 
introduction of a Jbar gene into tobacco, potato and tomato plants 
(Exhibit L) and Brassica species (Exhibit M) . None of the 
foregoing plants are monocots, and it is submitted that successful 
introduction into these plants would in no way suggest that is was 
possible to introduce the same gene into corn, or what would be the 
effect of the introduced gene(s) on the resultant corn plant. See, 
In re Goodman, 29 U.S.P.Q.2d 2010 (Fed. Cir. 1993). 

Appellants have reviewed each of the two De Block articles and 
have been unable to find any disclosure in either that would teach 



or suggest the introduction of a bar gene into maize. Thus, it is 
difficult to see how such a reference could provide a motivation 
for introducing the gene into corn. Moreover, even if the Board 
were to conclude that there was somehow an "implied" motivation, 
there appears to be ho disclosure in either article that would 
teach how one would proceed to introduce the Jbar gene into corn. 
The only technique disclosed in either reference is that of 
AgroJbacteriujn-mediated transformation, which for the reasons 
discussed above with respect to Goldman is submitted to be 
insufficient and inappropriate in the case of corn. 

It is further pointed out that neither of the two De Brock 
references teach that a Jbar gen"e can be successfully introduced 
into corn, and each fail to teach or suggest that a Jbar gene would 
function appropriately to impart herbicide resistance upon 
introduction into corn. 

(2) fertile transgenic maize bearing a nitrilase gene (claims 
130, 138) 

Appellants' specification refers to the nitrilase gene at page 
13, lines 2-3, and incorporates by reference the article of Stalker 
et al. (Exhibit N) . As can be seen, the Stalker et al . article 
relates simply to the cloning of the nitrilase bxn gene, and fails 
to in any way teach or suggest to idea of introducing the gene into 
the corn genome. Furthermore, the article fails to provide any 
discussion relevant to how the ordinarily skilled worker would 
proceed to introduce the gene into corn, fails to provide any 
evidence that such an introduction would be successful if 



attempted, and fails to provide any evidence that the bxn would 
function appropriately if introduced into corn. 

In fact, that only discussion in Stalker et al . relevant to 
plants can be found at the very end, where it is mentioned that 
recent studies have shown that the gene has been introduced into 
"transgenic plants," but the kind of plants are not identified. A 
later article by Stalker indicates that these experiments were 
carried out merely in tobacco, again using AgrroJbacteriu/n-mediated 
transformation (which is a routine task in dicots such as tobacco) . 

(3) fertile transgenic maize bearing a j3-lactamase gene 
(claims 131, 139) 

The jS-lactamase gene is disclosed in the specification at page 

13, lines 15-16, which refers to the article of Sutcliffe (Exhibit 

0) . The Sutcliffe article relates to the cloning of the ampicillin 

resistance gene in an £. coli host using the pBR322 vector. No 

disclosure from Sutcliffe relevant to the cloning of the ampicillin 

resistance gene in corn has been identified, and this article fails 

to in any way teach or suggest the idea of introducing the gene 

into the corn genome. Furthermore, the article fails to provide 

any discussion relevant to how one would proceed to introduce the 

gene into corn, fails to provide any evidence that such an 

introduction would be successful if attempted, and fails to provide 

any evidence that the gene would function appropriately if 

introduced into corn. 



(4) fertile transgenic maize bearing a xylE gene (claims 132, 
140) 

The xylE gene is disclosed in the specification at page 13, 
line 21, which refers to the article of Zukowski et al. (Exhibit 
P) . The Zukowski article relates to the cloning of the xylE gene 
in E. coli and B . subtilis hosts. No disclosure from Zukowski 
relevant to the introduction of the xylE gene into corn has been 
identified, and this article fails to in any way teach or suggest 
to idea of introducing the gene into the corn genome. Furthermore, 
the article fails to disclose or suggest how one would proceed to 
introduce the gene into corn, fails to provide any evidence that 
such an introduction would be successful if attempted, and fails to 
provide any evidence that the gene would function appropriately if 
introduced into corn. 

(5) fertile transgenic maize bearing a galactosidase gene 
(claims 133, 141) 

While the subject specification mentions the use of the 

galactosidase gene at page 13, -line 24, it does not refer to a 

specific reference. However, Appellants enclose the exemplary 

reference of Teeri et al. (Exhibit Q) which discloses the 

introduction of the galactosidase gene into transgenic tobacco. 

However, as with the previous references, Appellants have been 

unable to identify any teaching from Teeri relevant to the cloning 

of the galactosidase gene in corn, and this article fails to in any 

way teach or suggest to idea of introducing the gene into the corn 

28 



genome. Furthermore, the article fails to disclose or suggest how 
one would proceed to introduce the gene into corn, fails to provide 
any evidence that such an introduction would be successful if 
attempted, and fails to provide any evidence that the gene would 
function appropriately if introduced into corn. 

(6) fertile transgenic maize bearing a tyrosinase gene 
(claims 134, 142) 

The tyrosinase gene is disclosed in the specification at page 

13, line 21, which refers to the article of Katz et al . (Exhibit 

R) . The Katz article relates to the cloning of the tyrosinase 

gene. No disclosure from Katz - relevant to the cloning of the 

tyrosinase gene in corn has been identified, and this article fails 

to in any way teach or suggest to idea of introducing the gene into 

the corn genome. Furthermore, the article fails to disclose or 

suggest how one would proceed to introduce the gene into corn, 

fails to provide any evidence that such an introduction would' be 

successful if attempted, and fails to provide any evidence that the 

gene would function appropriately if introduced into corn. 

(7) fertile transgenic maize bearing a wheat germ agglutinin 
gene (claims 103, 143) 

Lastly, while the subject specification mentions the use of 

the wheat germ agglutinin gene at page 17, line 2, it does not 

refer to a specific reference for the cloning of the gene. 

However, Appellants enclose the exemplary reference of Raikhel et 

al. (Exhibit S) which discloses the cloning of the wheat germ 



agglutinin gene in E. coli. However, as with the previous 
references, Appellants have been unable to identify any teaching 
from Raikhel et al . directed to the introduction of the wheat germ 
agglutinin gene in corn, and this - article fails to in any way teach 
or suggest to idea of introducing the gene into the corn genome. 
Furthermore, the article fails to teach or suggest how one would 
proceed to introduce the gene into corn, fails to provide any 
evidence that such an introduction would be successful if 
attempted, and fails to provide any evidence that the gene would 
function appropriately if introduced into corn. 

AT BEST, IT CAN MERELY BE SAID THAT THE PRESENT INVENTION IS 
"OBVIOUS TO TRY" — AN IMPROPER BASIS FOR CONCLUDING 
OBVIOUSNESS 

At best, it can merely be said that introduction of the known 
genes was "obvious to try." Appellants would direct the Board's 
attention to the case of In re O'Farrell, 7 U.S.P.Q.2d 1673, 1680 
(Fed. Cir. 1988) , which held that in order for a reference or 
references to obviate an invention, it must be shown that the 
reference (s) contains: 

(1) detailed enabling methodology for practicing the claimed 
invention; 

(2) a suggestion for modifying the prior art to practice the 
claimed invention; and 

(3) evidence suggesting that the invention would be 
successful. 

It is submitted that the present references relied upon by the 



Examiner clearly fail to satisfy this tripartite test of O'Farrell. 
In particular, for the reasons discussed above none of the 
references provides a suggestion for combining the teachings of, 
e.gr., references teaching particular genes with the Goldman 
reference, and none provides a reasonable expectation that such a 
combination would be successful, in that for the reasons detailed 
above there is clear uncertainty as to the operability of the 
Goldman technique. Moreover, there is certainly no suggestion for 
modifying these prior disclosures in a manner that would allow one 
to arrive at the invention, and certainly no evidence that any 
particular modification would be successful. 

In In re Vaeck, 20 U.S.P.Q. 1438 (Fed. Cir. 1991), the Federal 
Circuit took the O'Farrell doctrine a step further. In Vaeck the 
Federal Circuit stated that in order for an examiner to make out a 
prima facie case of obviousness two things must be shown: 1) that 
the prior art would have suggested to those of ordinary skill in 
the art that they should make the claimed composition; and 2) that 
the prior art must demonstrate a reasonable expectation of success 
of the invention. Both the suggestion and the reasonable 
expectation of success must be founded in the prior art, not in the 
applicant's disclosure. Here, for the reasons discussed above, we 
have neither. 



IN THAT COMPOSITIONS ARE CLAIMED, EVEN THE EXISTENCE OF AN 
OPERABLE METHOD IS INSUFFICIENT TO SUPPORT AN OBVIOUSNESS 
REJECTION 

The recent case of In re Deuel, 34 U.S.P.Q.2d 1210 (Fed. Cir. 

1995) provides still further strong support for a conclusion of 

non-obviousness. In Deuel, the Court held that with respect to 

claims directed to compositions, the fact that methods for making 

the composition were known or obvious, and even the existence of a 

motivation to do so, were irrelevant to the question of obviousness 

of the composition. In order the find the composition obvious, an 

examiner is required to demonstrate structural obviousness of the 

composition without reliance upon the existence of an obvious 

method for its preparation: 

The PTO's focus on known methods for potentially 
isolating the claimed DNA molecules is also misplaced because 
the claims at issue define compounds, not methods. See In re 
Bell , 991 F.2d 781, 785, 26 USPQ2d 1529, 1532 (Fed. Cir. 
1993). In Bell , the PTO asserted a rejection based upon the 
combination of a primary reference disclosing a protein (and 
its complete amino acid sequence ) with a secondary reference 
describing a general method of gene cloning. We reversed the 
rejection, holding in part that " [t]he PTO's focus on Bell's 
method is misplaced. Bell does not claim a method. Bell 
claims compositions, and the issue is the obviousness of the 
claimed composition, not of the method by which they are 
made." Id. . . . Thus, even if, as the examiner stated, the 
existence of general cloning techniques, coupled with 
knowledge of a protein's structure, might have provided 
motivation to prepare a cDNA or made it obvious to prepare a 
cDNA, that does not necessarily make obvious a particular 
claimed cDNA. "Obvious to. try" has long been held not to 
constitute obviousness. In re O'Farrell , 853 F.2d 894, 903, 
7 USPQ2d 1673, 1680-81 (Fed. Cir, 1988). (emphasis in 
original) 

34 U.S.P.Q.2d at 1215-16. 

It is submitted that the Examiner has failed to meet the 

burden set by Deuel and its predecessors in making out and 



supporting a case for structural obviousness of the claimed 
fertile, transgenic plants. The mere fact that Goldman may 
disclose techniques that may or may not be useful in the 
preparation of transgenic maize plants is submitted to be 
irrelevant under Deuel. The Deuel and Bell line of cases require 
the Examiner to make a prima facie -obviousness rejection without 
reliance upon "general cloning techniques," as admonished by the 
Deuel court. This has not been accomplished in the present case. 

APPELLANTS' BURDEN OF PROOF IS MERELY BY A "PREPONDERANCE OF 
THE EVIDENCE", NOT "CLEAR AND CONVINCING" AS ARGUED BY THE 
EXAMINER 

Although the Examiner concluded that "sufficient, clear, and 
convincing" evidence must be provided that the present invention 
would not result from the routine use of the Goldman et al . 
procedures with other known genes, this is not the appropriate 
burden for Appellants to meet. It is well settled that the proper 
test of a description in a publication relied on as § 102 prior art 
requires a determination of whether or not one of ordinary skill in 
the art would take the disclosure in the publication and combine it 
with his own knowledge of the art., and from the knowledge be put in 
possession of the claimed invention. However, the burden placed on 
Appellants by this type of rejection is to rebut this "presumption 
of operability" of a reference by a preponderance of the evidence. 
In re Sasse, 207 USPQ 107 (CCPA 1980). 
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STRONG SECONDARY EVIDENCE EXISTS IN SUPPORT OF NON-OBVIOUSNESS 
OF THE PRESENT INVENTION 

Appellants would like to review for the Board what is 
considered to be very strong secondary evidence of non-obviousness, 
in the form of various scientific articles evidencing the long felt 
but unresolved need for the invention, the failure of others to 
achieve the invention, and scientific accolades of peers in the 
research community for the ultimate achievement of the present 
invention. 

Applicants will turn first to two review articles of Dr. Ingo 

Potrykus, a well known expert in the field of plant transformation 

(exhibits T and U) . In the former of the foregoing review 

articles, Dr. Potrykus, as late as June 1990, refers to the serious 

difficulties associated with genetic engineering of monocots: 

...my personal experience in working towards the genetic 
engineering of cereals for the last 18 years convinces me 
that we still have serious problems in front of us.... 
It seems to me that we are really not yet close to such 
a situation. 

p. 535, col. 1-2. 

Pessimism pervades Dr. Potrykus' 1989 review article: 

Despite considerable efforts in the genetic engineering 
of plants, and notably achievements in that some species, 
the world's major cereal crops are proving remarkably 
recalcitrant to genetic transformation. 

Exhibit U, p. 269, abstract. 

An article from the journal Science, published shortly after 

the announcement that transgenic, fertile corn has been achieved, 

emphasizes the long-felt but unresolved need, as well as the 

failure of others such as Carol Rhodes ("Corn Transformed", Exhibit 
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V) . The article begins by noting that the achievement of fertile, 
transgenic corn is "the capstone of almost a decade's efforts to 
genetically engineer this country's most important crop," and then 
continues by noting the "years of frustration: and a renewed effort 
to genetically engineer corn begun by Carol Rhodes and her 
colleagues. The article notes that while Dr. Rhodes and her group 
were successful in regenerating transformed corn, their 
"celebrations were short-lived: the resulting plants were 
infertile. " 

The Science article then refers to attempts by the CIBA-Geigy 
group, who were said to have achieved the regeneration of non- 
transgenic corn cells into fertile plants. However, it is pointed 
out that "these techniques, so far, have not worked with 
genetically transformed corn." 

The Rhodes et al . (see Exhibit W) referred to in the Science 
article discussed above is particularly relevant. The Rhodes 
article — which is dated well after the Goldman filing date — 
demonstrated using a later technique that while various genes could 
readily be introduced into corn, of 38 transformed plants derived 
from 10 different cell lines, NONE were found to be fertile 
(Exhibit W p. 206, col. 3) . The Rhodes article is submitted as 
evidence of nonobviousness — it represents the failure of others 
to achieve fertile, transgenic corn, and demonstrates that the 
technical breakthrough of the present invention is not the mere 
introduction of DNA into the corn genome, but the ability to do so 
in a reproducible manner that achieves fertile, transgenic plants 



and transgenic offspring plants. It will be recalled that Rhodes 
et al. were routinely successful using an electric field to 
introduce individual genes into corn cells, but all of the plants 
that were prepared turned out to be infertile. This is a strong 
secondary consideration that must be considered by the Board in 
resolving the obviousness question. 

In the November 1990 issue of Bioworld (Exhibit X) , in an 
article entitled "They Make it Happen in Biotech," Dr. Catherine 
Mackey, head of the DeKalb transformation research team, was picked 
as one of four scientists shaping the biotech industry's growth (p. 
3 6) . The article refers to the achievement of genetically- 
engineered corn as "one of the Holy Grails" of monocot 
transformation. The article then states that monocots such as corn 
"have been the toughest nuts to crack" in agricultural 
biotechnology . 

Similarly, in the March 1990 issue of Agricultural Genetics 

Report, the race for corn transformation is characterized as 

biotechnology's "run for the roses" and "Holy Grail": 

The Bottom Line: Well, more heard from in agricultural 
biotechnology's run for the roses — corn transformation. 
As we have observed earlier in this space, stable 
transformation of maize has always been the Holy Grail of 
agricultural biotechnology. 

Exhibit Y, p. 2. 

In the April, 1990, issue of the Case Reports of the Ct. 

Academy of Science and Engineering (Exhibit Z) , it is stated that 

the "DeKalb plants are the first documented fertile, transgenic 

corn, the result of nearly 10 years of genetic engineering. 
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The Oct. 1990 issue of the ARI Newsletter (Exhibit AA) 
observed that the previous report by Sandoz Crop Protection was 
able to produce only infertile transgenic corn plants. 

In the July, 1990, issue of the Ag Consultant, an article 
characterizes the DeKalb achievement as a "major breakthrough in 
biotechnical research" that "could possibly open up whole new areas 
of plant improvement." (Exhibit BB) . 

The October 1990 issue of Genetic Technology News included a 
special section detailing the "breakthrough" represented by 
genetically-engineered corn. The article lauds the efforts of the 
DeKalb plant genetics research team and states that "now we know 
for sure that it is possible to genetically engineer corn" (Exhibit 
CC) . 

Further, the AG Biotechnology News characterizes the 
development of this technology as "revolutionary" (Exhibit DD, col. 
1) * 

The DeKalb team's efforts in achieving genetically engineered 
corn did not go unnoticed in the lay press either. The Investor's 
Daily characterized this achievement as "an advance that other 
scientists hailed yesterday as a breakthrough" (Exhibit EE) . The 
Rockford Register Star characterized this achievement as "the 
launching point for a genetic engineering revolution" (Exhibit FF, 
col. 1). The Chicago Tribune quoted the Assistant director of the 
Washington University Center for Plant Science as the achievement 
being a "tremendous breakthrough." (Exhbit GG) . Moreover, the Wall 
Street Journal recognized the great achievement represented by the 



achievement of fertile, transgenic corn: 

While scientists have been able to get new genes into 
monocot plants, the plants previously have ended up 
sterile and unable to pass the new genes on in their 
seed , (emphasis supplied) 

Exhibit HH, col. 1. 

In conclusion, it is submitted that the foregoing strong 
evidence of non-obviousness conclusively demonstrates the 
significant achievement represented by the present invention to 
others in the corn industry. From this evidence it is submitted 
that a conclusion of non-obviousness is inescapable. Therefore, 
for each of the foregoing reasons, Appellants respectfully submit 
that the Board must conclude that the invention defined by the 
present claims is patentable over the art. 

The Board is requested to reconsider and overrule the 
Examiner's rejection based upon Goldman, taken alone or in 
combination with the art cited in Appellants' specification. 
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